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1. Foundations: atoms and molecules 

Classical Statistical Mechanics studies properties of macroscopic aggregates of parti- 
cles, atoms and molecules, based on the assumption that they are point masses subject 
to the laws of classical mechanics. Distinction between macroscopic and microscopic is 
evanescent and in fact the foundations of statistical mechanics have been laid on proper- 
ties, proved or assumed, of few particles systems. 

Macroscopic systems are often considered in stationary states: which means that 
their microscopic configurations follow each other as time evolves while looking the same 
macroscopically. Observing time evolution is the same as sampling ("not too closely" 
time-wise) independent copies of the system prepared in the same way. 

A basic distinction is necessary: a stationary state can be either in equilibrium or not. 
The first case arises when the particles are enclosed in a container O and are subject only 
to their mutual conservative interactions and, possibly, to external conservative forces: 
typical example is a gas in a container subject forces due to the walls of Vt and to gravity, 
besides the internal interactions. This is a very restricted class of systems and states. 

A more general case is when the system is in a stationary state but it is also subject 
to non conservative forces: a typical example is a gas or fluid in which a wheel rotates, as 
in the Joule experiment, with some device acting to keep the temperature constant. The 
device is called a thermostat and in statistical mechanics it has to be modeled by forces, 
non conservative as well, which prevent an indefinite energy transfer from the external 
forcing to the system: such transfer would impede reaching stationary states. For instance 
the thermostat could be simply a constant friction force (as in stirred incompressible liquids 
or as in electric wires in which current circulates because of an electromotive force) . 

A more fundamental approach would be to imagine that the thermostatting device is 
not a phenomenologically introduced non conservative force (like a friction force) but is 
due to the interaction with an external infinite system which is in "equilibrium at infinity" . 

In any event non equilibrium stationary states are intrinsically more complex than 
equilibrium states. Here attention will be confined to equilibrium statistical mechanics of 
systems of N identical point particles Q = (qi, . . . , q^r) enclosed in a cubic box O, with 
volume V and side L, usually supposed with perfectly reflecting walls. 

Particles of mass m located at q, q' will be supposed to interact via a pair potential 
</?(q — q'). Microscopic motion will follow the equations 

N 

rnq l = ~Y,9c l M^-^) + J2 WwaU ^ d - ~ 9 *MQ) (1-1) 

j-l i 

where the potential ip will be supposed smooth except, possibly, for |q — q'| < ro where it 
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could be +00 meaning the particles cannot come closer than tq and at ro (1.1) is interpreted 
by imagining that they undergo elastic collisions; the potential W wa u models the container 
and it will be replaced, unless explicitly stated, by an elastic collision rule. 

The time evolution (Q, Q) — > 5t(Q,Q) will, therefore, be described on the space 
J-{N) of the pairs position-velocity of the N particles or, more conveniently, on phase 
space: i.e. by a time evolution St on the space J-'(N) of the pairs (P, Q) momentum- 
position with P = mQ. The motion being conservative, the energy JJ d = J2i 2^ Pi + 

def 

Y.i<j ^(q« - Qj) + W wa ii(q.i) = K(P) + $(Q) will be a constant of motion; the last 
term in $ is missing if walls are perfect. This makes it convenient to regard the dynamics 
as associated with two dynamical systems (J-(N), St) on the 6iV dimensional phase space, 
and (J-'u(N), St) on the 6iV — 1 dimensional surface of energy U. Since the dynamics (1.1) 

is Hamiltonian on phase space, with Hamiltonian H(P, Q) = + ^(Q) d — K + 

it follows that the volume <i 3Ar P<i 37V Q is conserved {i.e. a region E has the same volume 
as S t E) and also the area S(H(P, Q) - U)d 3N Pd 3N Q is conserved. 

The above dynamical systems are well defined, i.e. St is a map on phase space globally 
defined for all t G (—00,00), when the interaction potential is bounded below: this is 
implied by the a priori bounds due to energy conservation. For gravitational or Coulomb 
interactions a lot more has to be said, assumed and done in order to even define the key 
quantities needed for a statistical theory of motion. 

Although our world is 3-dimensional (or at least was so believed to be until recent 
revolutionary theories) it will be useful to consider also systems of particles in dimension 
d 7^ 3: in this case the above 6N and 3iV become, respectively, 2dN and dN. Systems 
with dimension d = 1, 2 are in fact sometimes very good models for thin filaments or thin 
films. For the same reason it is often useful to imagine that space is discrete and particles 
can only be located on a lattice, e.g. on Z d , see Sect. 15. 



Bibliography: [Ga99]. 



2. Pressure, temperature and kinetic energy 



The beginning was Bernoulli's derivation of the perfect gas law via the identification 
of the pressure at numerical density p with the average momentum transferred per unit 
time to a surface element of area dS on the walls: i.e. the average of the observable 
2mv pv dS with v the normal component of the velocity of the particles which undergo 
collisions with dS. If f(v)dv is the distribution of the normal component velocity and 
f(v)d 3 v = Yli f( v i) d 3 v, v = (i>i,i>2,i>3), is the total velocity distribution the average of 
is pdS given by 

dS^ 2mv 2 pf(v)dv = dS J mv 2 pf(v)dv = p 2 -dS J U 2 f{v)d 3 v = p\(^)dS (2.1) 
Furthermore f(-ff) was identified as proportional to the absolute temperature (jj) d — 
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const^T which, with present day notations, is written §(-j|) = ksT. The constant ks 
was (later) called Boltzmann's constant and it is the same constant for all perfect gases at 
least. Its independence on the particular nature of the gas is a consequence of Avogadro 's 
law stating that equal volumes of gases at the same conditions of temperature and pressure 
contain equal number of molecules. 

Proportionality between average kinetic energy and temperature via the universal con- 
stant ks became, since, a fundamental assumption extending to all aggregates of particles 
gaseous or not, never challenged in all later works (until quantum mechanics, where this 
is no longer true, see Sect. 26). 

Bibliography: [Ga99]. 

3. Heat and entropy 

After Clausius' discovery of entropy and to explain it mechanically, Boltzmann in- 
troduced the heat theorem, which he developed to full generality between 1866 and 1884. 
Identification of absolute temperature with average kinetic energy and the heat theorem 
can be considered the founding elements of statistical mechanics. 

The theorem makes precise the notion of time average and then states in great gen- 
erality that given any mechanical system one can associate with its dynamics four quan- 
tities U, V,p,T, defined as time averages of suitable mechanical observables (i.e. functions 
on phase space) so that when the external conditions are infinitesimally varied and the 
quantities U,V change by dU,dV respectively, the ratio d - "jP d — is exact, i.e. there is 
a function S(U, V) whose corresponding variation equals the ratio. It will be better, for 
the purpose of considering very large boxes (V — > oo) to write this relation in terms of 

intensive quantities u d = jj and v = -j^ as 

du + pdv 

— is exact 

T 

i.e. the ratio equals the variation ds of s(j^, -j^) = jfS(U, V). 

The proof originally dealt with monocyclic systems, i.e. systems in which all motions 
are periodic. The assumption is clearly much too restrictive and justification for it devel- 
oped from the early "non periodic motions can be regarded as periodic with infinite period" 
(1866), to the later ergodic hypothesis and finally to the realization that, after all, the heat 
theorem does not really depend on the ergodic hypothesis (1884). 

Although for a one dimensional system the proof of the heat theorem is a simple check 
it was a real breakthrough because it led to answering the general question of under which 
conditions one could define mechanical quantities whose variations were constrained to sat- 
isfy (3.1) and therefore could be interpreted as mechanical model of Clausius' macroscopic 
thermodynamics. It is reproduced in the next few lines. 

Consider a one-dimensional system subject to forces with a confining potential <p(x) 
such that Iv 3 '^)! > for \x\ > 0, (p"(0) > and ip(x) x ^oo> + oo. All motions are periodic 
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so that the system is monocyclic. Suppose that the potential <p(x) depends on a parameter 
V and define a state to be a motion with given energy U and given V; let 



U = total energy of the system = K + $ 

T = time average of the kinetic energy K, (K) , . 

F = the parameter on which (p is supposed to depend ^ ' ' 

p = — time average of dyp, —(dv<*p)- 

A state is thus parameterized by U, V; if such parameters change by dU, dV, respectively, 

def def 

let: dL = — pdV, dQ = dU +pdV. Then (3.1) holds. In fact let x±(U, V) be the extremes 
of the oscillations of the motion with given U, V and define S as: 

rx+(u,v) f (dU - d V (p(x)dV) -% 

S = 2 log / y/(U-<p(x))dx =► dS= ^ (3.3) 

Jx-(U,V) J A^f 

Noting that -4= = J -^dt, (3.1) follows because time averages are given by integrating 



with respect to ^= and dividing by the integral of 



Bibliography: [Bo84], [Ga99]. 



4. Heat theorem and ergodic hypothesis 



Boltzmann tried to extend the result beyond the one dimensional systems: for in- 
stance to Keplerian motions: which are not monocyclic unless only motions with a fixed 
eccentricity are considered. However the early statement that "aperiodic motions can be 
regarded as periodic with infinite period" is really the heart of the application of the heat 
theorem for monocyclic systems to the far more complex gas in a box. 

Imagine that the gas container O is closed by a piston of section A located to the 
right of the origin at distance L and acting as a lid, so that the volume is V = AL. 
The microscopic model for the piston will be a potential Tp{L — £) if x = (£, 77, Q are the 
coordinates of a particle. The function Tp{r) will vanish for r > ro, for some r$ L, and 
diverge to +00 at r = 0. Thus tq is the width of the layer near the piston where the force 
of the wall is felt by the particles that happen to be roaming there. 

The contribution to the total potential energy $ due to the walls is W wa ii = J2j ^p{L — 
£j) and dyTp = A~ x djjp; assuming monocyclicity it is necessary to evaluate the time 
average of 8l^(x) = djyV wa ii = — J2j ~ As time evolves the particles Xj with £j 
in the layer within ro of the wall will feel the force exercised by the wall and bounce back. 
One particle in the layer will contribute to the average of 8l^(x) the amount 

2 I 1 —Tp'iL - Qdt (4.1) 

total time J to 

if to is the first instant when the point j enters the layer and t\ is the instant when 
the ^-component of the velocity vanishes "against the wall". Since —Tp'(L — £j) is the 
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^-component of the force, the integral is 2m|£,-| (by Newton's law), provided > of 
course. 

Suppose that no collisions between particles occur while the particles travel within the 
range of the potential of the wall: i.e. the mean free path is much greater than the range 
of the potential Tp defining the wall. The collisions contributions to the average momentum 
transfer to the wall per unit time is therefore given by, see (2.1), f v>Q 2mv f(v) 

Pwall Av dv 

if Pwaii, f(v) are the average density near the wall and, respectively, the average fraction 
of particles with a velocity component normal to the wall between v and v + dv. Here 
p, f are supposed to be independent of the point on the wall: this should be true up to 
corrections of size o(A). 

Thus writing the average kinetic energy per particle and per velocity component, 
/ fv 2 f(v)dv, as i/T 1 (c.f. (2.1)) it follows that 

- (d V $) = PwallP- 1 (4-2) 

has the physical interpretation of pressure. The is the average kinetic energy per 

degree of freedom: hence it is proportional to the absolute temperature T {c.f. Sect. 2). 

On the other hand if motion on the energy surface takes place on a single periodic 
orbit the quantity p in (4.2) is the right quantity that would make the heat theorem 
work, see (3.2). Hence regarding the trajectory on each energy surface as periodic (i.e. the 
system as monocyclic) leads to the heat theorem with p, U, V, T having the right physical 
interpretation corresponding to their appellations. This shows that monocyclic systems 
provide natural models of thermodynamic behavior. 

Assuming that a chaotic system like a gas in a container of volume V will satisfy 
"for practical purposes" the above property a quantity p can be defined such that dU + 
pdV admits the inverse of the average kinetic energy (K) as an integrating factor and 
furthermore p, U, V, (K) have the physical interpretations of pressure, energy, volume and 
(up to a proportionality factor) absolute temperature. 

Boltzmann's conception of space (and time) as discrete allowed him to conceive the 
property that the energy surface is constituted by "points" all of which belong to a single 
trajectory: a property that would be impossible if phase space was really a continuum. 
Regarding phase space as consisting of a finite number of "cells" of finite volume h dN , for 
some h > (rather than of a continuum of points) allowed him to think, without logical 
contradiction, that the energy surface consisted of a single trajectory and, hence, that 
motion was a cyclic permutation of its points (actually cells). 

Furthermore it implied that the time average of an observable F(P, Q) had to be 
identified with its average on the energy surface computed via the Liouville distribution 
C' 1 J F(P, Q)6(H(P, Q) - U) dPdQ with C = J 6(H(P, Q) - U) dPdQ (the appropriate 
normalization factor) : a property that was written symbolically ^ = J^-^q or 

r 1 f v(q(t> n\\* jF(P',Q')5(H(P',Q')-U)dP'dQ' 

t^tX mp,Q))dt= — IsmpfiQf) _ u)dpidQf — ( 4 - 3 ) 

The validity of (4.3) for all (piecewise smooth) observables F and for all points of the 
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energy surface, with the exception of a set of zero area, is called the ergodic hypothesis 



Bibliography: [Bo84], [Ga99]. 



5. Ensembles 



Eventually (1884) Boltzmann realized that the validity of the heat theorem for aver- 
ages computed via the r.h.s. of (4.3) held independently of the ergodic hypothesis, i.e. the 
(4.3) was not necessary because the heat theorem {i.e. (3.1)) could also be derived un- 
der the only assumption that the averages involved in its formulation were computed as 
averages over phase space with respect to the probability distribution in the r.h.s. of (4.3). 

Furthermore, if T was identified with the average kinetic energy, U with the average 
energy, p with the average force per unit surface on the walls of the container O with 
volume V, the relation (3.1) held for a variety of families of probability distributions on 
phase space besides the (4.3). Among which 



(a) the microcanonical ensemble, which is the collection of probability distributions in the 
r.h.s. of (4.3) parameterized by u = jf,v = (energy and volume per particle) 

<; (rf PdQ) = ^ N< V) 6(H( P , Q) - V) ™*, (5.1) 

where h is a constant with the dimensions of an action which, in the discrete representation 
of phase space mentioned in Sect. 4, can be taken such that h dN equals the cells volume 
and, therefore, the integrals with respect to (5.1) can be interpreted as an (approximate) 
sum over the cells conceived as microscopic configurations of N indistinguishable particles 
(whence the Nl). 

(b) the canonical ensemble which is the collection of probability distributions parameterized 

by P, v = J 

to which more ensembles can be added. For instance Gibbs introduced 



(c) the grand canonical ensemble which is the collection of probability distributions pa- 
rameterized by /?, A and defined over the space T gc = U^ =0 JF(A^) 

u 9C (dPdO) - 1 (3\n-0H(p,q) dVdQ (r. o \ 

t*f,,xWd<4) - ZM ^ v) e m hdN (5-3) 

Hence there are several different models of Thermodynamics: a key test for accepting them 
as real microscopic descriptions of macroscopic Thermodynamics is that 
(1) a correspondence between the macroscopic states of thermodynamic equilibrium and 
the elements of a collection of probability distributions on phase space can be established 
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by identifying on one side macroscopic thermodynamic states with given values of the 
thermodynamic functions and, on the other side, probability distributions attributing the 
same average values to the corresponding microscopic observables (i.e. whose averages 
have the interpretation of thermodynamic functions). 

(2) once the correct correspondence between the elements of the different ensembles is 
established, i.e. once the pairs (u,v), (/3,v), (/?, A) are so related to produce the same 

values for the averages U, V, kBT d = j3~ x , p|<9f2| of 



(^an(qi) is a delta function pinning qi to the surface dfl) then the averages of all physi- 
cally interesting observables should coincide at least in the thermodynamic limit, Q — > oo. 
In this way the elements ji of the considered collection of probability distributions can be 
identified with the states of macroscopic equilibrium of the system. The /it's depend on pa- 
rameters and therefore they form an ensemble: each of them corresponds to a macroscopic 
equilibrium state whose thermodynamic functions are appropriate averages of microscopic 
observables and therefore are functions of the parameters identifying \i. 

Remark: The word ensemble is often used to indicate the individual probability dis- 
tributions of what is called here ensemble. The meaning used here seems closer to the 
original sense in the 1884 paper of Boltzmann (in other words often by "ensemble" one 
means that collection of the phase space points on which a given probability distribution 
is considered, and this does not seem to be the original sense). 

For instance in the case of the microcanonical distributions this means interpreting 
energy, volume, temperature and pressure of the equilibrium state with specific energy 
u and specific volume v as proportional, through appropriate universal proportionality 
constants, to the integrals with respect to jU^(dPdQ) of the mechanical quantities in 
(5.4); and the averages of other thermodynamic observables in the state with specific 
energy u and specific volume v should be given by their integrals with respect to 

Likewise one can interpret energy, volume, temperature and pressure of the equilib- 
rium state with specific energy it and specific volume v as the averages of the mechanical 
quantities (5.4) with respect to the canonical distribution ^ v (dP dQ) which has average 
specific energy precisely it; and the averages of other thermodynamic observables in the 
state with specific energy and volume u and v are given by their integrals with respect to 
Hp v . And a similar definition can be given for the description of thermodynamic equilibria 
via the grand canonical distributions. 

Bibliography: [Gi81], [Ga99]. 

6. Equivalence of ensembles 

Boltzmann proved that, computing averages via the microcanonical or canonical 
distributions, the essential property (3.1) was satisfied when changes in their parameters 
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(i.e. u,v or fi,v respectively) induced changes du.dv on energy and volume. He also 
proved that the function s, whose existence is implied by (3.1), was the same function once 
expressed as a function of u, v (or of any pair of thermodynamic parameters, e.g. of T,v 
or of p, u). A close examination of Boltzmann's proof shows that the (3.1) holds exactly in 
the canonical ensemble and up to corrections tending to as O — > oo in the micro canonical 
ensemble. Identity of thermodynamic functions evaluated in the two ensembles holds, as 
a consequence, up to corrections of this order. And Gibbs added that the same held for 
the grand canonical ensemble. 

Of course not every collection of stationary probability distributions on phase space 
would provide a model for Thermodynamics: Boltzmann called orthodic the collections of 
stationary distributions which generated models of Thermodynamics through the above 
mentioned identification of its elements with macroscopic equilibrium states. The micro- 
canonical, canonical and the later grand canonical ensembles are the chief examples of 
orthodic ensembles: which Boltzmann and Gibbs proved to be not only orthodic but to 
generate the same thermodynamic functions, i.e. to generate the same Thermodynamics 

This freed from the analysis of the truth of the doubtful ergodic hypothesis (still 
unproved in any generality) or of the monocyclicity (manifestly false if understood literally 
rather than regarding phase space as consisting of finitely many small, discrete, cells), and 
allowed Gibbs to formulate the problem of Statistical Mechanics of equilibrium as 

Problem: study the properties of the collection of probability distributions constituting (any) 
one of the above ensembles. 

However by no means the three ensembles just introduces exhaust the class of orthodic 
ensembles producing the same models of Thermodynamics in the limit of infinitely large 
systems. The wealth of ensembles with the orthodicity property, hence leading to equiv- 
alent mechanical models of Thermodynamics, can be naturally interpreted in connection 
with the phase transitions phenomenon, see Sect. 9. 

Clearly the quoted results do not "prove" that thermodynamic equilibria "are" de- 
scribed by the microcanonical or canonical or grand canonical ensembles. However they 
certainly show that for most systems, independently of the number of degrees of freedom, 
one can define quite unambiguously a mechanical model of Thermodynamics establishing 
parameter free, system independent, physically important relations between Thermody- 
namic quantities (e.g. d u p^ v \ = d v T( l^ , from (3.1)). 

The ergodic hypothesis which was at the root of the mechanical theorems on heat and 
entropy cannot be taken as a justification of their validity. Naively one would expect that 
the time scale necessary to see an equilibrium attained, called recurrence time scale, would 
have to be at least the time that a phase space point takes to visit all possible microscopic 
states of given energy: hence an explanation of why the necessarily enormous size of the 
recurrence time is not a problem becomes necessary.. 

In fact the recurrence time can be estimated once the phase space is regarded as 
discrete: for the purpose of countering mounting criticism Boltzmann assumed that mo- 
mentum was discretized in units of (2m/csT)2 (i.e. the average momentum size) and 
space was discretized in units of p~3 (i.e. the average spacing), implying a volume of 

cells h 3N with h d = (2m/csT)^; then he calculated that, even with such a gross dis- 
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cretization, a cell representing a microscopic state of 1cm 3 of hydrogen at normal con- 
dition would require a time (called "recurrence time") of the order of ~ 10 10 times 
the age of the Universe (!) to visit the entire energy surface. [In fact the phase space 
volume is r = (p~ 3 N{2mkBT)^) N = h 3N and the number of cells of volume h 3N is 
T/(N\h 3N ) ~ e 3N : and the time to visit all will be e 3N To with To a typical atomic unit, 
e.g. 10~ 12 sec: but N = 10 19 ]. In this sense the statement boldly made by the young 
Boltzmann that "aperiodic motions can be regarded as periodic with infinite period" was 
even made quantitative. 

The time is clearly so long to be irrelevant for all purposes: nevertheless the correctness 
of the microscopic theory of Thermodynamics can still rely on microscopic dynamics once 
it is understood, as stressed by Boltzmann, that the reason why we observe approach 
to equilibrium, and equilibrium itself, over "human" time scales, far shorter than the 
recurrence times, is due to the property that on most of the energy surface the (very few) 
observables whose averages yield macroscopic thermodynamic functions (namely pressure, 
temperature, energy . . .) assume the same value even if N is only very moderately large (of 
the order of 10 3 rather than 10 19 ). This implies that this value coincides with the average 
and therefore satisfies the heat theorem without any contradiction with the length of the 
recurrence time. The latter rather concerns the time needed to the generic observable 
to thermalize, i.e. to reach its time average: the generic observable will indeed take a 
very long time to "thermalize" but no one will ever notice because the generic observable 
(e.g. the position of a pre-identified particle) is not relevant for Thermodynamics. 

The word "proof" used so far in this paper is not in the mathematical sense: the 
relevance of a mathematically rigorous analysis was widely realized only around the 1960's 
at the same time when the first numerical studies of the thermodynamic functions became 
possible and rigorous results became needed to check the correctness of various numerical 
simulations. 



Bibliography: [B066], [Bo84], [Ga99]. 
7. Thermodynamic limit 

Adopting Gibbs axiomatic point of view it is interesting to see the path to be followed 
to achieve an equivalence proof of three ensembles introduced in Sect 4. 

A preliminary step is to consider, given a cubic box Q of volume V = L d , the nor- 
malization factors Z3 C (I3, A, V), Z c (f3, N, V),Z mc (U, N, V) in (5.1), (5.2), (5.3), and to check 
that the following thermodynamic limits exist: 

(3p gc ((3, A) = f lim I log Z gc (P, A, V) 

V -^00 v 

-0/cOM = f lim i-logZ c (/3,A,U) (71) 
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where the density p = v _1 = y is used instead of v for later reference. The normalization 
factors play an important role because they have simple thermodynamic interpretation, see 
Sect. 8: they are called grand canonical, canonical and microcanonical partition functions, 
respectively. 

Not surprisingly assumptions on the interparticle potential y>(q — q') are necessary to 
achieve an existence proof of the limits in (7.1). The assumptions on <p are not only quite 
general but also have a clear physical meaning. They are 

(1) stability: i.e. existence of a constant B > such that J2i<j 'Pio.i ~ Qj) > —BN for all 
N >0,q 1 ,...,q N e R d , and 

(2) temperedness: i.e. existence of constants Eq,R > such that |</?(q — q')| < £?|q — 
q /|-d-e f or | q - q '| > iJ. 

The assumptions are satisfied by essentially all microscopic interactions with the no- 
table exceptions of the gravitational and Coulombic interactions, which require a separate 
treatment (and lead to somewhat different results on the Thermodynamic behavior). 

For instance the (1), (2) are satisfied if (f(q) is +oo for |q| < r and smooth for 
|q| > ro, for some ro > 0, and furthermore <£>(q) > -Bo|q| _< ~ d+e °' ) if < |q| < R, while for 
jqj > R it is |<p(q)| < Bi |q| _ ( d+e °) with B , Bi, So > 0, R > r suitably chosen. Briefly 
<f is fast diverging at contact and fast approaching at large distance. This is called a 
(generalized) Lennard-Jones potential. If ro > the <p is called a hard core potential. If 
Bi = the potential is said to have finite range. See Appendix Al for physical implications 
of violations of the above stability and temperedness properties. However in the following 
it will necessary, both for simplicity and to contain the length of the exposition, to restrict 
consideration to the case Bi = 0, i.e. to 

V?(q) > S |qr (d+£o) , r <\q\<R and \(p(q)\ = 0, |q| > R (7.2) 

unless explicitly stated. 

Assuming (1) and (2) the existence of the limits in (7.1) can be mathematically proved: 
in Appendix A2 the proof of the first is analyzed to provide the simplest example of the 
technique. A remarkable property of the functions f3p gc ((3, \), —(3pf c ((3, p), ps mc (u, p) is 
that they are convex functions: hence they are continuous in the interior of their do- 
mains of definition and at one variable fixed are, with at most countably many exceptions, 
differentiable in the other. 

In the case of a potential without hard core (p max = oo) the —pf c ((3, p) can be checked 
to tend to slower than p as p — > and to — oo faster than — p as p — > oo (essentially 
proportionally to — p log p in both cases). Likewise in the same case s mc (u, p) can be shown 
to tend to slower than and to — oo faster than — u as u — > oo. 

The latter asymptotic properties can be exploited to derive from the relations between the 
partition functions in (7.1) 

^ c (/3, A, V) = En=o e? XN Z c (P, N, V) and 

Z C ((3,N,V)= f^° B e-P u Z rnc (U,N,V)dU ^ ' ' 

and from the mentioned convexity the consequences 



2//ebbraio/2008; 14:37 



10 



PPmc(P, A) =max(/3At;- 1 - pv' 1 UP^v' 1 )) 

-i -i ( 7 - 4 ) 

-PMP,v ) =max(-/3w + k B s mc (u,v )) 

and that the maxima are attained in points, or intervals, internal to the intervals of 
definition. Let v gc ,u c be points where the maxima are respectively attained in (7.4). 

Remark that the quantity e za\ff\v)^ nas the interpretation of probability of a 
density v~ l = N/V evaluated in the grand canonical distribution. It follows that if the 
maximum in the first of (7.4) is strict, i.e. it is reached at a single point, the values of 
v~ x in closed intervals not containing the maximum point v~} have a probability behaving 
as < e~ cV ,c > 0, as V — > oo, compared to the probability of v~ x, s in any interval con- 
taining iC, 1 . Hence v gc has the interpretation of average value of v in the grand canonical 
distribution, in the limit V — > oo. 

Likewise the interpretation of ^y^ N,V ^ as probability in the canonical dis- 
tribution of an energy density u shows that if the maximum in the second of (7.4) is strict 
the values of u in closed intervals not containing the maximum point u c have a proba- 
bility behaving as < e~ cV ,c > 0, as V — > oo, compared to the probability of u's in any 
interval containing u c . Hence in the limit O — > oo the average value of w in the canonical 
distribution is u c . 

If the maxima are strict the (7.4) also establishes a relation between the grand canon- 
ical density, the canonical free energy and the grand canonical parameter A, or between 
the canonical energy, the microcanonical entropy and the canonical parameter (3: 

A = d v -i(v~cf c (P,v~c)), k B /3 = duSmducV' 1 ) (7.5) 

where convexity and strictness of the maxima imply the derivatives existence. 

Remark: Therefore in the equivalence between canonical and microcanonical ensembles the 
canonical distribution with parameters (/3, v) should correspond with the microcanonical 
with parameters (u c ,v). And the grand canonical distribution with parameters (/?, A) 
should correspond with the canonical with parameters (/3, v gc ) . 

Bibliography: [Ru69], [Ga99]. 

8. Physical interpretation of thermodynamic functions 

The existence of the limits (7.1) implies several properties of interest. The first is 
the possibility of finding the physical meaning of the functions p gc , f c , s mc and of the 
parameters A, (3 

Note first that, for all V the grand canonical average (K) ^ x is ^/3~ 1 (N) l3 x so that 
P~ x is proportional to the temperature T gc = T(/3, A) in the gand canonical distribution: 

= kBT((3, A). Proceeding heuristically the physical meaning of p((3, A) and of A can 
be found by the following remarks. 
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Consider the microcanonical distribution and denote by J* the integral over 
(P, Q) extended to the domain of the (P, Q) such that H(P, Q) = U and, at the same 
time, qi G dV where dV is an infinitesimal volume surrounding the region O. Then by the 
microscopic definition of the pressure p, see Sect. 1, it is 



HV= N (* 2p\ dPdQ _ 2 r dP dQ 

P Z(U,N,V)J 3 2mN\h dN - 3Z(U,N,V) J [ ' N\h dN K ' 

where 5 = 5(H(P, Q) — £/"). The r.h.s. of (8.1) can be compared with dyZ z = 
Z( ^ )V) /* to S ive 9vZ z dV = N iqWr = PP dV because (K)*, which denotes the 

average J* K/ J* 1, should be essentially the same as the microcanonical average (K) mc 
(i.e. insensitive to the fact that one particle is constrained to the volume dV) if N is large. 
In the limit V — > oo, jj = v the latter remark together with the second of (7.5) yields 

k^dySradu.v' 1 ) = (3 p(u, v) , k^ 1 d u s mc (u, v) = (3 (8.2) 

respectively. Note that p > and it is not increasing in v because s mc (p) is concave as a 
function of v = p~ x (in fact by the remark following (7.2) ps mc (u, p) is convex in p and, 
in general, if pg(p) is convex in p then giv -1 ) is, always, concave in v = p _1 ). 

Hence ds mc (u, v) = ^H+fffe so that taking into account the physical meaning of p, T (as 
pressure and temperature, see Sec. 2), s mc is, in Thermodynamics, the entropy. Therefore, 
see the second of (7.4), —f3f c ([3, p) = —/3u c + k^ 1 s mc (u c , p) becomes 

fc(P,p) = u c -T c s mc (u c ,p), df c = -pdv - s mc dT (8.3) 

and since u c has the interpretation, as mentioned in Sect. 7, of average energy in the 
canonical distribution p c p v it follows that f c has the thermodynamic interpretation of free 
energy (once compared with the free energy definition F = U — TS in Thermodynamics) . 

By (7.5), (8.3) A = d v -i (v~^f c ((3, v~})) = u c - T c s mc +pv gc and v gc has the meaning 
of specific volume v. Hence, after comparison with the chemical potential definition AV^ = 
U — TS + pV in Thermodynamics, it follows that the thermodynamic interpretation of 
A is the chemical potential and, see (7. 4), (7. 5), the grand canonical relation j3p(j3,X) = 
f3\v~c — f3v~c(—f3u c + k^ 1 s mc (u c , i> -1 )) shows that p(/3,\) = p implying that p(/3, A) is 
the pressure expressed, however, as a function of temperature and chemical potential. 

To go beyond the heuristic derivations above it should be remarked that convexity and 
the property that the maxima in (7. 4), (7. 5) are reached in the interior of the intervals of 
variability of v or u are sufficient to turn the above arguments into rigorous mathematical 
deductions: this means that given (8.2) as definitions of p(u,v),f3(u,v) the second of 
(8.3) follows as well as p(j3, A) = p(u v , iC. 1 ). But the values v gc and u c in (7.4) are 
not necessarily unique: convex functions can contain horizontal segments and therefore 
the general conclusion is that the maxima may possibly be attained in intervals. Hence 
instead of a single v gc there might be a whole interval where the r.h.s. of (7.4) 

reaches the maximum and instead of a single u c there might be a whole interval [it_,it+] 
where the r.h.s. of (7.5) reaches the maximum. 
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Convexity implies that the values of A or of (3 for which the maxima in (7.4) or (7.5) are 
attained in intervals rather than in single points are rare {i.e. at most denumerably many): 
the interpretation is, in such cases, that the thermodynamic functions show discontinuities: 
and the corresponding phenomena are called phase transitions, see Sect. 9. 

Bibliography: [Ru69], [Ga99]. 

9. Phase transitions and boundary conditions 

The analysis in Sect. 7, 8 of the relations between elements of ensembles of distributions 
describing macroscopic equilibrium states not only allows us to obtain mechanical models 
of Thermodynamics but it also shows that the models, for a given system, coincide at least 
as O — > oo. Furthermore the equivalence between the thermodynamic functions computed 
via corresponding distributions in different ensembles can be extended to a full equivalence 
of the distributions. 

If the maxima in (7.4) are attained in single points v gc or u c the equivalence should 
take place in the sense that a correspondence between /U^ C A , \i c ^ v , \i™ c v can be established 
so that given any local observable F(P, Q), defined as an observable depending on (P, Q) 
only through the p;,qi with G A where A C O is a finite region, has the same average 
with respect to corresponding distributions in the limit O — > oo. 

The correspondence is established by considering (A, (3) < — ► (/?, v gc ) < — ► (u mc , v) where 
v gc is where the maximum in (7.4) is attained, u mc = u c is where the maximum in (7.5) is 
attained and v gc = v, (c.f. also (8. 2), (8. 3)). This means that the limits 

lim / F(P,Q)/j a (dP dQ) d = (F) = a- independent, a = gc,c,mc (9.1) 

coincide if the averages are evaluated by the distributions /«| C A , Hp Vc , H^ c , Vmc 

Exceptions to the (9.1) are possible: and are certainly likely to occur at values of u, v 
where the maxima in (7.4) or (7.5) are attained in intervals rather than in isolated points; 
but this does not exhaust, in general, the cases in which (9.1) may not hold. 

However no case in which (9.1) fails has to be regarded as an exception. It rather 
signals that an interesting and important phenomenon occurs. To understand it properly 
it is necessary to realize that the grand canonical, canonical and microcanonical families 
of probability distributions are by far not the only ensembles of probability distributions 
whose elements can be considered to generate models of Thermodynamics, i.e. which are 
orthodic in the sense of Sect. 6. More general families of orthodic statistical ensembles of 
probability distributions can be very easily conceived. In particular 

Definition: consider the grand canonical, canonical and microcanonical distributions as- 
sociated with an energy function in which the potential energy contains, besides the inter- 
action $ between particles located inside the container, also the interaction energy &in }0 ut 
between particles inside the container and external particles, identical to the ones in the 
container but not allowed to move and fixed in positions such that in every unit cube A 
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external to Q there is a finite number of them bounded independently of A. Such configu- 
rations of external particles will be called "boundary conditions of fixed external particles". 

The thermodynamic limit with such boundary conditions is obtained by considering 
the grand canonical, canonical and microcanonical distributions constructed with potential 
energy function $ + Qi n ,out in containers O of increasing size taking care that, while the 
size increases, the fixed particles that would become internal to O are eliminated. The 
argument used in Sec. 7 to show that the three models of thermodynamics, considered there, 
did define the same thermodynamic functions can be repeated to reach the conclusion that 
the also the (infinitely many) "new" models of Thermodynamics in fact give rise to the 
same thermodynamic functions and averages of local observables. Furthermore the values 
of the limits corresponding to (7.1) can be computed using the new partition functions 
and coincide with the ones in (7.1) (i.e. are boundary conditions independent). 

However it may happen, and in general it really happens, for many models and for 
particular values of the state parameters, that the limits in (9.1) do not coincide with the 
analogous limits computed in the new ensembles: i. e. the averages of some local observables 
are unstable with respect to changes of fixed particles boundary conditions. 

There is a very natural interpretation of such apparent ambiguity of the various mod- 
els of Thermodynamics: namely at the values of the parameters that are selected to de- 
scribe the macroscopic states under consideration there may correspond different equilib- 
rium states with the same parameters. When the maximum in (7.4) is reached on an 
interval of densities one should not think of any failure of the microscopic models for Ther- 
modynamics: rather one has to think that there are several states possible with the same 
(3, A and that they can be identified with the probability distributions obtained by form- 
ing the grand canonical, canonical or microcanonical distributions with different kinds of 
boundary conditions. 

For instance a boundary condition with high density may produce an equilibrium state 
with parameters /?, A which has high density, i.e. the density v^ 1 at the right extreme of 
the interval in which the maximum in (7.4) is attained, while using a low density boundary 
condition the limit in (9.1) may describe the averages taken in a state with density vZ 1 at 
the left extreme of the interval or, perhaps, with a density intermediate between the two 
extremes. Therefore the following definition emerges. 

Definition: If the grand canonical distributions with parameters (/?, A) and different 
choices of fixed external particles boundary conditions generate for some local observable 
F average values which are different by more than a quantity 5 > for all large enough 
volumes Q then one says that the system has a phase transition at (/?, A) . This implies that 
the limits in (9.1), when existing, will depend on the boundary condition and their values 
will represent averages of the observables in "different phases" . A corresponding definition 
is given in the case of the canonical and microcanonical distributions when, given (/?, v) or 
(u, v), the limit in (9.1) depends on the boundary conditions for some F . 

Remarks: (1) The idea is that by fixing one of the thermodynamic ensembles and by vary- 
ing the boundary conditions one can realize all possible states of equilibrium of the system 
that can exist with the given values of the parameters determining the state in the chosen 
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ensemble (i.e. ((3, A), (/?, v) or (tt, f ) in the grand canonical, canonical or micro canonical 
cases respectively). 

(2) The impression that in order to define a phase transition the thermodynamic limit is 
necessary is incorrect: the definition does not require considering the limit O — > oo. The 
phenomenon is that by changing boundary conditions the average of a local observable can 
change at least by amounts independent of the system size. Hence occurrence of a phase 
transition is perfectly observable in finite volume: it suffices to check that by changing 
boundary conditions the average of some observable changes by an amount whose minimal 
size is volume independent. It is a manifestation of an instability of the averages with 
respect to boundary conditions changes: an instability which does not fade away when 
the boundary recedes to infinity, i.e. boundary perturbations produce bulk effects and at 
a phase transitions the averages of the local observable, if existing at all, will exhibit a 
nontrivial boundary conditions dependence. This is also called "long range order" . 

(3) It is possible to show that when this happens then some thermodynamic function whose 
value is boundary condition independent (like the free energy in the canonical distribu- 
tions) has discontinuous derivatives in terms of the parameters of the ensemble defining 
them. This is in fact one of the frequently used alternative definitions of phase transitions: 
the latter two natural definitions of first order phase transition are equivalent. However 
it is very difficult to prove that a given system shows a phase transition. For instance 
existence of a liquid-gas phase transition is still an open problem in systems of the type 
considered until Sect. 15. 

(4) A remarkable unification of the theory of the equilibrium ensembles emerges: all dis- 
tributions of any ensemble describe equilibrium states. If a boundary condition is fixed 
once and for all then some equilibrium states might fail to be described by an element of 
an ensemble. However if all boundary conditions are allowed then all equilibrium states 
should be realizable in a given ensemble by varying the boundary conditions. 

(5) The analysis leads us to consider as completely equivalent without exceptions grand 
canonical, canonical or microcanonical ensembles enlarged by adding to them the distri- 
butions with potential energy augmented by the interaction with fixed external particles. 

(6) The above picture is really proved only for special classes of models (typically in models 
in which particles are constrained to occupy points of a lattice and in systems with hard 
core interactions, ro > in (7.2)) but it is believed to be correct in general: at least it is 
consistent with all what is so far known in classical statistical mechanics. The difficulty 
is that conceivably one might even need boundary conditions more complicated than the 
fixed particles boundary conditions (like putting different particles outside interaction with 
the system with an arbitrary potential, rather than via cp). 

The discussion of the equivalence of the ensembles and the question of boundary 
conditions importance has already imposed the consideration of several limits as O — > oo. 
Occasionally it will again come up. For conciseness purposes it is useful to set up a formal 
definition of equilibrium states of an "infinite volume" system: although infinite volume is 
an idealization void of physical reality it is nevertheless useful to define such states because 
certain notions, like that of pure state can be sharply defined, with few words and avoiding 
wide circumvolutions, in terms of them. Therefore let: 

Definition: An infinite volume state with parameters ((3,\), or ((3,v) is a collection of 
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average values F — > (F) obtained, respectively, as limits of finite volume averages (F)^ 
defined from canonical, microcanonical or grand canonical distributions in Q n with fixed 
parameters A), (f3,v), or (u,v) and with general boundary condition of fixed external 
particles, on sequences fi n — > oo for which such limits exist simultaneously for all local 
observables F . 

Having set the definition of infinite volume state consider a local observable G(X) and 
let r^G(X) = G(X + £), £ G K d , with X + £ denoting the configuration X in which all 
particles are translated by £: then an infinite volume state is a pure state if for any pair of 
local observables F, G it is 

(F n G)-(F)( n G) 1 ^0 (9.2) 

which is called a cluster property of the pair F, G. 

The result alluded to in remark (6) is that at least in the case of hard core systems (or 
of the simple lattice systems discussed in Sect. 15) the infinite volume equilibrium states 
in the above sense exhaust at least the totality of the infinite volume pure states. And 
furthermore the other states that can be obtained in the same way are convex combinations 
of the pure states, i.e. they are "statistical mixtures" of pure phases. Note that (r^G) 
cannot be replaced, in general, by (G) because not all infinite volume states are necessarily 
translation invariant and in simple cases (crystals) it is even possible that no translations 
invariant state is a pure state. 

Remarks: (1) This means that, in the latter models, generalizing the boundary conditions, 
e.g. considering external particles not identical to the ones in the system, or using periodic 
or partially periodic boundary conditions or the widely used alternative device of intro- 
ducing a small auxiliary potential and first taking the infinite volume states in presence 
of it and then letting the potential vanish does not enlarge further the set of states (but 
may sometimes be useful: an example of a study of a phase transition by using the latter 
method of small fields will be given in Sect. 13). 

(2) If x is the indicator function of a local event it will make sense to consider the proba- 
bility of occurrence of the event in a infinite volume state defining it as (x). In particular 
the probability density for finding p particles in xi, X2, . . . , x p , called the p-point correla- 
tion function, will thus be defined in an infinite volume state. For instance if the state 
is obtained as a limit of canonical states (-)n„ with parameters /?, p, p = ^ ZL , in a se- 
quence of containers fi n , then p(x) = lim n (X^^Ti <K X ~~ t l7')) n > an< ^ /°( x i5 x 2, • • • , x p ) = 
lim n (X^" i rij=i ^( x i — CLij))^ 7 where the sum is over the ordered p-ples (ji, . . . ,j p ). 
Thus the pair correlation p(q, q') and its possible cluster property are 



_ / nw e-^^^>-"^n- a )d qi ...dq jVn _ 2 
Mq ' qJ » (N n - 2)1 ZS(P,p,V n ) (9.3) 

p(q,(q , + 0)-p(q)p(q , + Oi=^o 
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where Zq = J e ^ u ^dQ is the "configurational" partition function. 

Bibliography: [Ru69], [D068], [LR69], [Ga99]. 

10. Virial theorem and atomic dimensions 



For a long time it has been doubted that "just changing boundary conditions" could 
produce such dramatic changes as macroscopically different states (i.e. phase transitions in 
the sense of the definition in Sect. 9). The first evidence that by taking the thermodynamic 
limit very regular analytic functions like iV -1 log Z c (f3, N, V) as a function of v = -j^ 
could develop in the limit O — > 00 singularities like discontinuous derivatives (correspond- 
ing to the maximum in (7.4) being reached on a plateau and to a consequent existence of 
several pure phases) arose in the van der Waals ' theory of liquid gas transition. 

Consider a real gas with N identical particles with mass m in a container O with 
volume V. Let the force acting on the i-th particle be ff, multiplying both sides of the 
equations of motion mq^ = f* by — ^q* and summing over % it follows — \ ^=1 m( \i ' = 

— \Y^=i c Li ' ^% d — ! |C(q) an d the quantity C(q) defines the virial of the forces in the 
configuration q; note that C(q) is not translation invariant because of the presence of the 
forces due to the walls. 

Writing the force fj as a sum of the internal and of the external forces (due to the 
walls) the virial C can be expressed naturally as sum of the virial C int of the internal forces 
(translation invariant) and of the virial C ext of the external forces. 

By dividing both sides of the definition of the virial by r and integrating over the time 
interval [0, r] one finds in the limit r — > +00, i.e. up to quantities relatively infinitesimal 
as t — > 00, that (K) = |(C), and (C ext ) = 3pV where p is the pressure and V the volume. 
Hence (K) = \pV + \{C int ) or 

Equation (10.1) is the Clausius' virial theorem: in the case of no internal forces it yields 
(3pv = 1, the ideal gas equation. 

The internal virial Ci nt can be written, if fj->i = —d cli (p(qi — q^), as Ci nt = — X^ili 
fj-u q, = Yli<j dqifiQi ~ Qj) ' (q* — Qj) which shows that the contribution to 
the virial by the internal repulsive forces is negative while that of the attractive forces 
is positive. The average of Ci n t can be computed by the canonical distribution, which is 
convenient for the purpose, van der Waals first used the virial theorem to perform an 
actual computation of the corrections to the perfect gas laws. Simply neglect the third 
order term in the density and use the approximation p(qi,q2) = p 2 e~ f3<p ( cll ~ cl2 ' ) for the 
pair correlation function, (9.3), then 

l(C 1 nt) = V^p 2 I((3)+VO(p 3 ) (10.2) 

where I((3) = \ J ( e -^ q ) - l) d 3 q and the equation of state (10.1) becomes pv + ffi + 
0( v -i)=[3-\ 



2//ebbraio/2008; 14:37 



17 



For the purpose of illustration the calculation of I can be performed approximately at 
"high temperature" (j3 small) in the case <p(r) = 4e((^-) 12 — (^r) 6 ) (the classical Lennard- 
Jones potential), e, > 0. The result is I = —(b— /3a), b = 4i>o, « = vo = -^(^f) 3 , : 

hence pv + f - £ = \ + O(^), (p + £) v = (1 + J) J = ^± + O(^) or: 

(p+4)(^- 6 )/ 3 = i + ^" 2 )' ( lo - 3 ) 

ir 

which gives the equation of state for j3e <C 1. Equation (10.3) can be compared with the 
well-known empirical van der Waals equation of state: 

(3(p + a/v 2 )(v-b) = 1 or (p + An 2 /V 2 )(V - nB) = nRT (10.4) 

where, if N A is Avogadro's number, A = aN\, B = bN A , R = k B N A , n = N/N A . It 
shows the possibility of accessing the microscopic parameters e and tq of the potential tp 
via measurements detecting deviations from the Boyle-Mariotte law f3pv = 1 of the rarefied 
gases: e = 3a/8b = 3A/8BN A , r = (3b/2n) 1 / 3 = (3S/2tt^) 1 /3. 

As a final comment it is worth stressing that the virial theorem gives in principle 
the exact corrections to the equation of state, in a rather direct and simple form, as time 
averages of the virial of the internal forces. Since the virial of the internal forces is easy to 
calculate from the positions of the particles as a function of time the theorem provides a 
method for computing the equation of state in numerical simulations. In fact this idea has 
been exploited in many numerical experiments, in which the (10.1) plays a key role. 



Bibliography: [Ga99]. 



11. van der Waals theory 



Equation (10.4) is empirically used beyond its validity region (small density and small 
P) by regarding A, B as phenomenological parameters to be experimentally determined by 
measuring them near generic values of p, V, T. The measured values of A, B do not "usually 
vary too much" as functions of v,T and, apart from this small variability, the predictions 
of (10.4) have reasonably agreed with experience until, as experimental precision increased 
over the years, serious inadequacies eventually emerged. 



P 




(1) 



vi 



V 



The van der Waals equation of state at a temperature T < T c where the pressure is not monotonic. The 
horizontal line illustrates the "Maxwell rule" . 
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Certain consequences of (10.4) are appealing: e.g. Fig.l shows that it does not give a 
p monotonic non increasing in v if the temperature is small enough. A critical temperature 
can be defined as the largest value T c of the temperature below which the graph of p as a 
function of v is not monotonic decreasing; the critical volume V c is the value of v at the 
horizontal inflection point occurring for T = T c . 

For T < T c the van der Waals interpretation of the equation of state is that the 
function p(v) may describe metastable states while the actual equilibrium states would 
follow an equation with a monotonic dependence on v and p(v) becoming horizontal in the 
coexistence region of specific volumes. The precise value of p where to draw the plateau 
(see Fig.l) would then be fixed by experiment or theoretically predicted via the simple 
rule that the plateau associated with the represented isotherm is drawn at a height such 
that the area of the two cycles in the resulting loop are equal. 

This is Maxwell's rule: obtained by assuming that the isotherm curve joining the 
extreme points of the plateau and the plateau itself defines a cycle, see Fig.l, representing a 
sequence of possible macroscopic equilibrium states (the ones corresponding to the plateau) 
or states with extremely long time of stability ( "metastable" ) represented by the curved 
part. This would be an isothermal Carnot cycle which therefore could not produce work: 
since the work produced in the cycle (i.e. § pdv) is the signed area enclosed by the cycle the 
rule just means that the area is zero. The argument is doubtful at least because it is not 
clear that the intermediate states with p increasing with v could be realized experimentally 
or could even be theoretically possible. 

A striking prediction of (10.4), taken literally, is that the gas undergoes a "gas-liquid" 
phase transition with a critical point at a temperature T c , volume v c and pressure p c that 
can be computed via (10.4) and are given by RT C = 8A/27B, V c = 3S, (n = 1). 

At the same time this is interesting as it shows that there are simple relations between 
the critical parameters and the microscopic interaction constants (s ~ ksTc and ro — 
(Vc/Na)) 1 / 3 : e = 81/c B T c /64, r = (y c /27rA^A) 1/3 if a classical Lennard-Jones potential 
(i.e. ip = 4£((y^j) 12 — (]^j) 6 )? see Sect. 10) is used for the interaction potential <p. 

However (10.4) cannot be accepted acritically not only because of the approximations 
(essentially the neglecting of 0(v~ x ) in the equation of state), but mainly because, as 
remarked above, for T < T c the function p is no longer monotonic in v as it must be, see 
comment following (8.2). 

The van der Waals' equation, refined and complemented by Maxwell's rule, predicts 
the following behavior: 



which are in sharp contrast with the experimental data gathered in the twentieth century. 
For the simplest substances one finds instead 8 = 5, (3 = 1/3. 

Finally blind faith in the equation of state (10.4) is untenable, last but not least, also 
because nothing in the analysis would change if the space dimension was d = 2 or d = 1: 
but in the last case, d = 1, it is easily proved that the system, if the interaction decays 
rapidly at infinity, does not undergo phase transitions, see Sect. 12. 



9 



-p c )^(v-v c ) s S = 3,T = T C 

vi) oc (T c -Tf (3 = 1/2, for T -> T~ 



(11.1) 
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In fact it is now understood that van der Waals' equation represents rigorously only a 
limiting situation, in which particles have a hard core interaction (or a strongly repulsive 
one at close distance) and a further smooth interaction ip with very long range. More 
precisely suppose that the part of the potential outside a hard core radius ro > is 
attractive (i.e. non negative) and has the form 7 d ^i(7 _1 |q|) < and call Po(v) the ((3- 
independent) product of (3 times the pressure of the hard core system without any attractive 
tail(Po(v) is not explicitly known except if d = 1 in which case it is Po(v)(v—b) = 1, b = ro), 
and let a = — \ J| q | >ro | v?i (q) | c?q. If p(j3,v;^) is the pressure when 7 > then it can be 
proved 

Pp(p,v)= f lim3p(P,v n ) = \-P± + Po (vj\ (11.2) 
7^0 



2 

V J Maxwell rule 



where the subscript means that the graph of p((3, v) as a function of v is obtained from the 
function in square bracket by applying to it Maxwell's rule, described above in the case 
of the van der Waals' equation. The (11.2) reduces exactly to the van der Waals equation 
in dimension d = 1 and for d > 1 it leads to an equations with identical critical behavior 
(even though Pq(v) cannot be explicitly computed). 



Bibliography: [LP79], [Ga99]. 



12. Absence of Phase Transitions: d = 1 



One of the most quoted "no go" theorems in Statistical Mechanics is that 1-dimensio- 
nal systems of particles interacting via short range forces do not exhibit phase transitions 
(c.f. Sect. 13) unless the somewhat unphysical situation of having zero absolute tempera- 
ture is considered. This is particularly easy to check in the case of "nearest neighbor hard 
core interactions": call ro the hard core size, so that the interaction potential <p(r) = +00 
if r < ro, and suppose also that <p(r) = if / > 2ro- In this case the thermodynamic 
functions can be exactly computed and checked to be analytic: hence the equation of state 
cannot have any phase transition plateau. This is a special case of van Hove's theorem 
establishing smoothness of the equation of state for interactions extending beyond the 
nearest neighbor and rapidly decreasing at infinity. 

If the definition of phase transition based on sensitivity of the thermodynamic limit 
to variations of boundary conditions is adopted then a more general, conceptually simple, 
argument can be given to show that in one-dimensional systems there cannot be any 
phase transition if the potential energy of mutual interaction between a configuration 
Q of particles to the left of a reference particle (located at the origin O, say) and a 
configuration Q' to the right of the particle (with Q U O U Q' compatible with the hard 
cores) is uniformly bounded below. Then a mathematical proof can be devised showing 
that boundary conditions influence disappears as the boundaries recede to infinity. One 
also says that no long-range order can be established in 1- dimensional, in the sense that 
one loses . 

The analysis fails if the space dimension is > 2: in this case, even if the interaction is 
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short ranged, the energy of interaction between two regions of space separated by a bound- 
ary is of the order of the boundary area. Hence one cannot bound above and below the 
probability of any two configurations in two half-spaces by the product of the probabilities 
of the two configurations, each computed as if the other was not there: because such bound 
would be proportional to the exponential of the surface of separation, which tends to oo 
when the surface grows large. This means that we cannot consider, at least not in general, 
the configurations in the two half spaces as independently distributed. 

Analytically a condition on the potential sufficient to imply that the energy between 
a configuration to the left and one to the right of the origin is bounded below, if the 
dimension d is d = 1, is simply expressed by J , r \ cp(r) \ dr < +00 for r' > tq. 

Therefore in order to have phase transitions in d = 1 a potential is needed that is "so 
long range" that it has a divergent first moment. It can be shown by counterexamples that 
if the latter condition fails there can be phase transitions even in d = 1 systems. 

The results just quoted apply also to discrete models like lattice gases or lattice spin 
models that will be considered later, see Sect. 15. 

Bibliography: [LL67], [Dy69], [Ga99], [GBG04]. 

13. Continuous symmetries: "no d = 2 crystal" theorem. 

A second case in which it is possible to rule out existence of phase transitions or 
at least of certain kinds of transitions arises when the system under analysis enjoys large 
symmetry. By symmetry it is meant a group of transformations acting on the configurations 
of a system and transforming each of them into a configuration which, at least for one 
boundary condition (e.g. periodic or open), has the same energy. 

A symmetry is said to be continuous if the group of transformations is a continu- 
ous group. For instance continuous systems have translational symmetry if considered in 
a container O with periodic boundary conditions. Systems with "too much symmetry" 
sometimes cannot show phase transitions. For instance the continuous translation sym- 
metry of a gas in a container O with periodic boundary conditions is sufficient to exclude 
the possibility of crystallization in dimension d = 2 . 

To discuss this, which is a prototype of a proof which can be used to infer absence 
of many transitions in systems with continuous symmetries, consider the translational 
symmetry and a potential satisfying, besides the usual (7.2) and with the symbols used in 
(7.2), the further property that |q| 2 |<9?-<^(q)| < £?|q|~( d+e °), with e G > 0, for some B holds 
for ro < |q| < R. This is a very mild extra requirement (and it allows for a hard core 
interaction). 

Consider an "ideal crystal" on a square lattice (for simplicity) of spacing a, exactly 
fitting in its container O of side L assumed with periodic boundary conditions: so that 
N = (^) d is the number of particles and a~ d is the density, which is supposed smaller than 
the close packing density if the interaction cp has a hard core. The probability distribution 
of the particles is rather trivial: /J = ^2 p Yl n S(q p ( n ) ~ an )^Ti the sum running over the 
permutations m — > p(m) of the sites m e O, m e Z d , < nii < La -1 . The density at q 
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is p(q) = En'K*! ~~ an ) = (X)j=i Qj)) an d its Fourier transform is proportional to 

p(k) = 7 ^(Ej e" lk -^) with k = with n G Z d . The p(k) has value 1 for all k of the 
form K = and ^0(max c= i, 2 | e lfc c a _ l| -2 ) otherwise. In presence of interaction it has 
to be expected that, in a crystal state, p(k) has peaks near the values K: but the value of 
p(k) can depend on the boundary conditions. 

Since the system is translation invariant a crystal state defined as a state with a 
distribution "close" to /Z, i.e. with p(q) with peaks at the ideal lattice points q = na, 
cannot be realized under periodic boundary conditions, even when the system state is 
crystalline. To realize such a state a symmetry breaking term in the interaction is needed. 

This can be done in several ways: e.g. by changing the boundary condition. Such a 
choice implies a discussion of how much the boundary conditions influence the positions of 
the peaks of p(k): for instance it is not obvious that a boundary condition will not generate 
a state with a period different from the one that a priori has been selected to disprove (a 
possibility which would imply a reciprocal lattice of K's different from the one considered 
to begin with). Therefore here the choice will be to imagine that an external weak force 
with potential eW(q) acts forcing a symmetry breaking that favors the occupation of 
regions around the points of the ideal lattice (which would mark the average positions of 
the particles in the crystal state that is being sought). The proof (Mermin's theorem) that 
no equilibrium state with particles distribution "close" to JI, i.e. with peaks in place of the 
delta functions (see below), is essentially reproduced below. 

Take W(q) = Enaesi — na ) where x(q) < is smooth and everywhere except in 
a small vicinity of the lattice points around which it decreases to some negative minimum 
keeping a rotation symmetry around them: the potential W is assumed invariant under 
the translations by the lattice steps. By the choice of the boundary condition and of eW 
the density p £ (q) will be periodic with period a so that p e (k) will, possibly, not have a 
vanishing limit as N — > oo only if k is a reciprocal vector K = -^-n. If the potential is 
ip + eW and if a crystal state in which particles have higher probability of being near the 
lattice points na exists, it should be expected that for e > small the system will be found 
in a state with Fourier transform p e (k) of the density satisfying, for some vector K ^ in 
the reciprocal lattice, 

lim lim |p e (K)| = r > 0, (13.1) 

e^O iV— >oo 

i.e. the requirement is that uniformly in e — > the Fourier transform of the density has 
a peak at some K ^ 0: note that if k is not in the reciprocal lattice p e (k) > 0, 

being bounded above by -^C^max^i^ \e lkja — 1|~ 2 ) because j^p e is periodic and has 
integral over q equal to 1. Hence excluding crystal existence will be identified with the 
impossibility of the (13.1). Other criteria can be imagined, e.g. considering crystals with 
a lattice different from simple cubic: they lead to the same result by following the same 
technique. Nevertheless it is not mathematically excluded (but unlikely) that with some 
weaker definition of crystal state existence could be possible even in 2 dimensions. 

The following inequalities hold under the present assumptions on the potential and in 
the canonical distribution with periodic boundary conditions and parameters ((3,p), p = 
a~ 3 in a box O with side multiple of a (so that iV = (La~ 1 ) d ) and potential of interaction 
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<p + eW. The further assumption that the lattice na is not a close packed lattice is 
(of course) necessary when the interaction potential has a hard core. Then for suitable 
B , B, Bi, B 2 > 0, independent of N, and e and for \k\ < ^ and for all O (z/K ^ 0) 



where the averages are in the canonical distribution (/?, p) with periodic boundary con- 
ditions and a symmetry breaking potential eW(q); 7(k) > is an (arbitrary) smooth a 
function vanishing for 2\k\ > 5 with 5 < — . See Appendix A3 for the derivation of (13.2). 

Multiplying both sides of the first in (13.2) by A^ _1 7(k) and summing over k the 
cristallinity condition in the form (13.1) implies that the r.h.s. is > Br 2 a d f\ K \ <s n 2^^ B2 ■ 

For d = 1, 2 the integral diverges, as or loge -1 respectively, implying |p e (K)| g ^ Q > r = 
0: the criterion of crystallinity (13.1) cannot be satisfied if d = 1,2. 

The above inequality is an example of a general class of inequalities called infrared 
inequalities stemming from another inequality called Bogoliubov's inequality, see Appendix 
A3, which lead to the proof that certain kinds of ordered phases cannot exist if the di- 
mension of the ambient space is d = 2 when a system in suitable boundary conditions 
(e.g. periodic) shows a continuous symmetry. The excluded phenomenon is, more pre- 
cisely, the non existence of equilibrium states exhibiting, in the thermodynamic limit, a 
symmetry lower than the continuous symmetry holding in periodic boundary conditions. 

In general existence of thermodynamic equilibrium states with symmetry lower than 
the symmetry enjoyed by the system in finite volume and under suitable boundary con- 
ditions is called a spontaneous symmetry breaking. It is yet another manifestation of 
instability with respect to changes in boundary conditions, hence its occurrence reveals a 
phase transition. There is a large class of systems for which an infrared inequality implies 
absence of spontaneous symmetry breaking: in most one or two dimensional systems a 
continuous symmetry cannot be spontaneously broken. 

The limitation to dimension d < 2 is a strong limitation to the generality of the 
applicability of infrared theorems to exclude phase transitions. More precisely systems 
can be divided into classes each of which has a "critical dimension" below which too much 
symmetry implies absence of phase transitions (or of certain kinds of phase transitions). 

It should be stressed that, at the critical dimension, the symmetry breaking is usually 
so weakly forbidden that one might need astronomically large containers to destroy small 
effects (due to boundary conditions or to very small fields) which break the symmetry: 
e.g. in the crystallization just discussed the Fourier transform peaks are only bounded by 
0( / —^ =): hence from a practical point of view it might still be possible to have some 

yloge- 1 

kind of order even in large containers. 




(13.2) 




Bibliography: [Me68], [Ho67], [Ru69]. 
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14. High Temperature and small density 



There is another class of systems in which no phase transitions take place. These 
are the systems with stable and tempered interactions ip, e.g. satisfying (7.2), in the high 
temperature and low density region. The property is obtained by showing that the equation 
of state is analytic in the variables (/?, p) near the origin (0, 0). 

A simple algorithm {Mayer's series) yields the coefficients of the virial series /3p(/3, p) = 
p + Ylh=2 c k(/3)p k - It has the drawback that the k-th order coefficient c/-(/3) is expressed 
as a sum of many terms (a number growing more than exponentially fast in the order k) 
and it is not so easy (but possible) to show combinatorially that their sum is bounded 
exponentially in k if j3 is small enough. A more common approach leads quickly to the 

def 

desired solution. Denoting <3>(qi, . . . , q n ) = J2i<j ^(Qi — Qj) consider the ("spatial or 
configurational" ) correlation functions defined, in the grand canonical distribution with 
parameters /?, A (and empty boundary conditions), by 

d ±f 1 y n+m f --£$(qi,-,q„,yi,-,y ro ) fol • • • d Yr 



0n (ai a ) = f - V/+ m / e -^(qi.-.qn.yi.-,ym) »yi---»^ ( U1 ) 



This is the probability density for finding particles with any momentum in the volume ele- 
ment dqi . . . dq n (irrespective of where other particles are), and z = e l3x (y / 2nm[3~ 1 h~ 2 ) d 
accounts for the integration over the momenta variables and is called the activity: it has 
the dimension of a density (c.f. (9.3)). 

Assuming that the potential has a hard core (for simplicity) of radius R the interaction 
energy $ qi (q 2 , . . . , q n ) of a particle in qi with any number of other particles in q 2 , . . . , q m 
with |qj — qj\ > R is bounded below by — B for some B > (related but not equal to the 
B in (7.2)). The functions pa will be regarded as a sequence of functions "of one, two,. . . 
particle positions": pa, = {pn(qi, • • • , <ln)}^Li vanishing for qj £ O. Then one checks that 



Pn(qi,---,q») = ^n,iXn(qi) + K p n {qi, . . . , q„) with 

Kp Q (q u . . . , q n ) = f e -^*-i (p fi (q 2 , . . . , q„) S n>1 + (14.2) 

oo p J J s 

+ ^ d Yl ...dy s jj (e _„ (qi _ yfe) _ l)pn(q25 ^ y ^ y ^ 

s=l^ n "" fe=l 

where 5 n ,i, $n>i are Kronecker deltas and Xsi(q) is the indicator function of O. The (14.2) 
is called the Kirkwood- Salzburg equation for the family of correlation functions in O. The 
kernel K of the equations is independent of O, but the domain of integration is O. 

Calling aa the sequence of functions ctn(qi, . . . , q n ) = if n ^ 1 and an(q) = zxn(q), 
a recursive expansion arises, namely 

pn = za n + z 2 Ka n + z 3 K 2 a n + z 4 K 3 a n + ■■■ (14.3) 

It gives the correlation functions, provided the series converges. The remark 

\KVan(qi,...,q n )\ < e W B+1 »{ J \e~^ - l\dq) P = f e W B+1 » r{(3)^ (14.4) 
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shows that the series (14.3), called Mayer's series converges if \z\ < e~( 2/3B+1 V(/3)~ 3 . 
Convergence is uniform (as O — > oo) and (X p )o;n(qi, . . . , q n ) tends to a limit as V — > oo 
at fixed qi,...,q n and the limit is simply (K p a)(qi, . . . , q n ), if a(qi,...,q n ) = for 
n^l, and ct(qi) = 1. This is because the kernel K contains the factors (e - ^' 11-5 ') — 1) 
which decay rapidly or will even be eventually 0, if (p has finite range. It is also clear that 
(K p a)(qi, . . . , q n ) is translation invariant. 

Hence if \z\e 2 ^ B+1 r((3) 3 < 1 the limits as O — > oo of the correlation functions exist 
and can be computed by a convergent power series in z; the correlation functions will be 
translation invariant (in the thermodynamic limit). 

In particular the one-point correlation function p = p(q) is p = z (l + 0(zr((3) 3 )), 
which to lowest order in z just shows that activity and density essentially coincide when 
they are small enough. Furthermore (3pn = y log Z gc {(3, A, V) is such that zd z (3po, = 
y J pn(q)dq (from the definition of in (14.1)). Therefore 

i r z dz' 

pp((3, z) = lim - log Z° c ([3, A, V) = / —p(p, z') (14.5) 

showing that the density p is analytic in z as well and p ~ z for 2; small: therefore 
the grand canonical pressure is analytic in the density and j3p = p(l + 0(p 2 )), at small 
density. In other words the equation of state is, to lowest order, essentially the equation of 
a perfect gas. All quantities that are conceivably of some interest turn out to be analytic 
functions of temperature and density. The system is essentially a free gas and it has no 
phase transitions in the sense of a discontinuity or of a singularity in the dependence of a 
thermodynamic function in terms of others. Furthermore the system cannot show phase 
transitions in the sense of sensitive dependence on boundary conditions of fixed external 
particles: this also follows, with some extra work, from the Kirkwood-Salzburg equations. 

Bibliography: [Ru69], [Ga99]. 

15. Lattice models 

The problem of proving existence of phase transitions in models of homogeneous gases 
with pair interactions is still open. Therefore it makes sense to study phase transitions 
problem in simpler models, tractable to some extent but nontrivial and which even have 
an interest in their own from the point of view of applications. 

The simplest models are the so-called lattice models in which particles are constrained 
to points of a lattice: they cannot move in the ordinary sense of the word (but of course 
they could jump) and therefore their configurations do not contain momentum variables. 

Interaction energy is just potential energy and ensembles are defined as collections of 
probability distributions on the position coordinates of the particle configurations. Usually 
the potential is a pair potential decaying fast at 00 and, often, with a hard core forbidding 
double or higher occupancy of the same lattice site. For instance the lattice gas with 
potential cp, in a cubic box O with |0| = V = L d sites of a square lattice with mesh a > 0, 
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is defined by the potential energy attributed to the configuration X of occupied distinct 
sites, i.e. subsets IcO: 

H(X) = - J2 Vix-y). (15.1) 

(x,y)€X 

where the sum is over pairs of distinct points in X. The canonical ensemble and the grand 
canonical ensemble are the collections of distributions, parameterized by (/?, p) (p = y) 
or, respectively, by (/?, A), attributing to X the probability: 



e -(3H(X) e f3\\X\ e -(3H(X) 

P ^ {X) = ZW,N,tl) 5 W> N > ° r mx(X) = WiPAM) ' (15 ' 2) 

where the denominators are normalization factors that can be called, in analogy with 
the theory of continuous systems, canonical and grand canonical partition functions; the 
subscript p stands for particles. 

A lattice gas in which in each site there can be at most one particle can be regarded 
as a model for the distribution of a family of spins on a lattice. Such models are quite 
common and useful: for instance they arise in studying systems with magnetic properties. 
Simply identify an "occupied" site with a "spin up" or + and an "empty" site with a "spin 
down" or — (say). If cr = {a x } xe n is a spin configuration, the energy of the configuration 
"for potential (p and magnetic field h v will be 

H ( (T ) = - ^2 i p( x -y)^x^y-h s ^a x (15.3) 

(x,y)eQ x 

with the sum running over pairs (x, y) G O of distinct sites. If <p(x — y) = J xy > the 
model is called a ferromagnetic Ising model. As in the case of continuous systems it will 
assumed finite range for (p: i.e. (p(x) = for \x\ > R, for some R, unless explicitly stated 
otherwise. 

The canonical and grand canonical ensembles in the box O with respective parameters 
(/?, m) or (P, h) will be defined as the probability distributions on the spin configurations 
f = {o~x}xen with J2 xE q o~ x = M = mV or without constraint on M, respectively; hence 

= Zg(/3,M,0) ' ^ (<T) = Wi/WT) (15 - 4) 

where the denominators are normalization factors again called the canonical and grand 
canonical partition functions. As in the study of the previous continuous systems canonical 
and grand canonical ensembles with "external fixed particle configurations" can be defined 
together with the corresponding ensembles with "external fixed spin configurations"; the 
subscript s stands for spins. 

For each configuration X C O of a lattice gas let {n x } be n x = 1 if x G X and 
n x = if x £ X. Then the transformation a x = 2n x — 1 establishes a correspondence 
between lattice gas and spin distributions. In the correspondence the potential <p(x — y) of 
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the lattice gas generates a potential \<p(x — y) for the corresponding spin system and the 
chemical potential A for the lattice gas is associated with a magnetic field h for the spin 
system with h = i(A + Y. x ^o L f( x ))- 

The correspondence between boundary conditions is natural: for instance a boundary 
condition for the lattice gas in which all external sites are occupied becomes a boundary 
condition in which external sites contain a spin +. The relation between lattice gas and 
spin systems is so close to permit switching from one to the other with little discussion. 

In the case of spin systems empty boundary conditions are often considered (no spins 
outside Q). In lattice gases and spin systems (as well as in continuum systems) often 
periodic and semiperiodic boundary conditions are considered (i.e. periodic in one or more 
directions and with empty or fixed external particles or spins in the others). 

Thermodynamic limits for the partition functions 



-pf(p,v) = lim llogZ p c (/3,iV,0), Pp(P,X)= lim ^ logZf (p, A, Q) 



v__ 

N ' 



-Pg(p,m)= lim 1 log Z s c (/3, M, O), p f(p, h) = lim 1 log Zf (p, A, Q) 



(15.5) 



can be shown to exist by a method similar to the one discussed in Appendix A2. They 
have convexity and continuity properties as in the cases of the continuum systems. In the 
lattice gas case the f,p functions are still interpreted as free energy and pressure. In the 
spin case /(/?, h) has the interpretation of magnetic free energy while g(P, m) does not have 
a special name in the thermodynamics of magnetic systems. As in the continuum systems 
it is occasionally useful to define infinite volume equilibrium states: 

Definition: : An infinite volume state with parameters (P,h), or (P,m) is a collec- 
tion of average values F — > (F) obtained, respectively, as limits of finite volume averages 
{F) n defined from canonical, or grand canonical distributions in fi n with fixed parameters 
(P,h) or (P,m), or (u,v) and with general boundary condition of fixed external spins or 
empty sites, on sequences fi n — > oo for which such limits exist simultaneously for all local 
observables F . 

This is taken verbatim from the definition in Sect. 9. In this way it makes sense to 
define the spin correlation functions for X = (£i,-..,£ n ) as if o~x = IX/ "^-- For 

def 

instance we shall call p(^i,^2) = ( cr ^i°"^ 2 ) an d a pure phase can be defined as an infinite 
volume state such that 

(axO-y+s) - {<T X ){<TY+t) j=^> ® (15-6) 
Bibliography: [Ru69], [Ga99]. 

16. Thermodynamic limits and inequalities 

An interesting property of lattice systems is that it is possible to study delicate ques- 
tions like existence of infinite volume states in some (moderate) generality. A typical 
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tool is the use of inequalities. As the simplest example of a vast class of inequalities 
consider the ferromagnetic Ising model with some finite (but arbitrary) range interaction 
J X y > in a field h x > 0: J, h may even be not translationally invariant. Then the 

def 

average of ax = a Xl a X2 . . . a Xn , X = (x\, . . . , x n ), in a state with "empty boundary con- 
ditions" (i.e. no external spins) satisfies the inequalities (crx), dh x (ax), dj xy (°~x) > 
X = (x\, . . . , x n ). More generally let H(cr) in (15.3) be replaced by H(cr) = — J2 X Jx^x 
with Jx > and X can be any finite set; then, if Y = (y 1? . . . , y n ), X = (x\, . . . , x n ), the 
following Griffiths inequalities hold 

(o-x)>0, dj Y (a x ) = (cjxo-y) - ((Jx)M > (16.1) 

The inequalities can be used to check, in ferromagnetic Ising models, (15.3), existence 
of infinite volume states (c.f. Sect. 9, 15) obtained by fixing the boundary condition B to 
be either "all external spins +" or "all external sites empty" . If (F) B n denotes the grand 
canonical average with boundary condition B and any fixed ft, h > 0, this means that for 
all local observables F(<T\), i.e. for all F depending on the spin configuration in any fixed 
region A, all the following limits exist 

Jim (F) B>n = (F) B (16.2) 

The reason is that the inequalities (16.1) imply that all averages (crx)g q are monotonic 
in O for all fixed IcO: so the limit (16.2) exists for F{&) = o~x- Hence it exists for all 
F depending only on finitely many spins, because any local function F "measurable in A" 
can be expressed (uniquely) as a linear combination of functions ax with X C A. 

Monotonicity with empty boundary conditions is seen by considering the sites outside 
O and in a region O' with side 1 unit larger than that of O and imagining that the couplings 
Jx with led' but X <f_ O vanish. Then (ctx)qi > (°~x)q because (ax)n> is an average 
computed with a distribution corresponding to an energy with the couplings Jx with 
X <£ O but X C fl' changed from to J x > 0. 

Likewise if the boundary condition is + then enlarging the box from O to O' cor- 
responds to decreasing an external field acting h on the external spins from +oo (which 
would force all external spins to be +) to a finite value h > 0: so that increasing the box O 
causes (ax) + q to decrease. Therefore as O increases Ising ferromagnets spin correlations 
increase if the boundary condition is empty and decrease if it is +. 

The inequalities can be used in similar ways to prove that the infinite volume states 
obtained from + or empty boundary conditions are translation invariant; and that in 
zero external field, h = 0, the + and — boundary conditions generate pure states if the 
interaction potential is only a pair ferromagnetic interaction. 

There are many other important inequalities which can be used to prove several ex- 
istence theorems along very simple paths. Unfortunately their use is mostly restricted to 
lattice systems and requires very special assumptions on the energy (e.g. ferromagnetic 
interactions in the above example). The quoted examples were among the first discovered 
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and provide a way to exhibit nontrivial thermodynamic limits and pure states. 



Bibliography: [Ru69], [Le74], [Ga99], [LL01], [Li02]. 

17. Symmetry breaking phase transitions 

The simplest phase transitions, see Sect. 9, are symmetry breaking transitions in 
lattice systems: they take place when the energy of the system in a container Q and with 
some special boundary condition (e.g. periodic, or antiperiodic or empty) is invariant with 
respect to the action of a group Q on phase space. This means that on the points x of phase 
space acts a group of transformations Q so that with each 7 e Q a map x — > £7 is associated 
which transforms x into x"f respecting the composition law in Q, i.e. (^7)7' = #(77'). If F 
is an observable the action of the group on phase space induces an action on the observable 

F changing F(x) into F 1 (x) d = F(x / y~ 1 ). 

A symmetry breaking transition occurs when by fixing suitable boundary conditions 
and taking the thermodynamic limit a state F — > (F) is obtained in which some local 
observable shows a non symmetric average (F) 7^ (F 7 ) for some 7. 

An example is provided by the "nearest neighbor ferromagnetic Ising model" on a d- 
dimensional lattice with energy function given by (15.2) with <p(x—y) = unless \x—y\ = 1, 
i.e. unless x, y are nearest neighbors, in which case tp(x — y) = J > 0. With periodic or 
empty boundary conditions it exhibits a discrete "up-down" symmetry a — > — cr. 

Instability with respect to boundary conditions can be revealed by considering the 
two boundary conditions, denoted + or — , in which the lattice sites outside the container 
Q are either occupied by spins + or by spins — . Consider also, for later reference, the 
boundary conditions in which the boundary spins in the upper half of the boundary are 
+ and the ones in the lower part are — : call this the ±-boundary condition; see Fig. 2. Or 
the boundary conditions in which some of the opposite sides of fi are identified while + or 
— conditions are assigned on the remaining sides: call these "cylindrical or semiperiodic 
boundary conditions". 

A new description of the spin configurations is useful: given cr draw a unit segment 
perpendicular to the center of each bond b having opposite spins at its extremes. An 
example of this construction is provided by Fig. 2 for the boundary condition ±. 



L" 



T 



+ 1 + + + + + + I - - - I + + + 1 + 



O 



□ 



LJ-G 



I— B 



L:_^Ek_:_^_:_^Z71_l_L 



Fig. 2: The dashed line is the boundary of fi; the outer spins correspond to the ± boundary condition. 
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The points A, B are points where an open "line" A ends. 



The set of segments can be grouped into lines separating regions where the spins are 
positive from regions where they are negative. If the boundary condition is + or — the 
lines form "closed polygons" while if the condition is ± there is also a single polygon Ai 
which is not closed (as in Fig. 2). If the boundary condition is periodic or cylindrical all 
polygons are closed but some may "go around" O. 

The correspondence (71, 72, . . . , 7 n , — >er, for the boundary condition ± or, for 
the boundary condition + (or —), cr< — ► (7i> • • • > 7n) * s one-to-one and, if h = 0, the en- 
ergy Hn(cr) of a configuration is higher than — J • (number of bonds in O) by an amount 
2J(|Ai| + E« l7il) or ? respectively, 2J^ |7j|. The grand canonical probability of each spin 
configuration is therefore proportional, if h = 0, respectively to 

e -2/3J(|A 1 |+^J 7l |) Qr e -2/3j£j7,| ( 17J ) 

and the "up-down" symmetry is clearly reflected by (17.1). 

The average (o~ x )q , of a + with + boundary conditions is given by (cr x )n + = 1 — 
2 Pn,+ (— ), where Pq i+ (— ) is the probability that the spin a x is —1. If the site x is occupied 
by a negative spin then the point x is inside some contour 7 associated with the spin 
configuration cr under consideration. Hence if ^(7) is the probability that a given contour 
belongs to the set of contours describing a configuration cr, it is Pa,+ {— ) < E 7 ox P(l) 
where 70X means that 7 "surrounds" x. 

If T = (71, . . . , 7 n ) is a spin configuration and if the symbol V compj means that the 
contour 7 is "disjoint" from 71, . . . , 7 n (^.e. {7 U V} is a new spin configuration), then 

p U) = 6^2 = e -2/3J|7l campy < 2/3J| 7 | (1? _ 2) 

because the last ratio in (17.2) does not exceed 1. Remark that there are at most 3 P 
different shapes of 7 with perimeter p and at most p 2 congruent 7's containing x; therefore 
the probability that the spin at x is — when the boundary condition is + satisfies the 
inequality P n ,+(-) < Y^=^ v e~^ Jv 73^0. 

This probability can be made as small as wished so that (<J x )q + is estimated by a 
quantity which is as close to 1 as desired provided (3 is large enough and the closeness of 
(°~x)q + to 1 is both x and O independent. 

A similar argument for the (— )-boundary condition, or the remark that for h = 
it is (o~ x )q_ = —(o~ x )q + , leads to conclude that, at large /3, (a x ) n _ 7^ (o~ x )q + and 
the difference between the two quantities is positive uniformly in O. This is the proof 
(Peierls' theorem) of the fact that there is, if (3 is large, a strong instability with respect 
to the boundary conditions of the magnetization: i.e. the nearest neighbor Ising model in 
dimension 2 (or greater, by an identical argument) has a phase transition. If the dimension 
is 1 the argument clearly fails and no phase transition occurs, see Sect. 12. 



Bibliography: [Ga99]. 
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18. Finite volume effects 



The description in Sect. 17 of the phase transition in the nearest neighbor Ising model 
can be made more precise from the physical point of view as well as from the mathematical 
point of view giving insights into the nature of the phase transitions. Assume that the 
boundary condition is the (+) -boundary condition and describe a spin configuration a by 
means of the associated closed disjoint polygons (71, ... , 7 n ). Attribute to cr = (7^ ... , 7 n ) 
a probability proportional to (17.1). Then the following Minlos-Sinai's theorem holds 

// (3 is large enough there exist C > 0, ^(7) > with ^(7) < e -2/3J ' 7 ' and such that a spin 
configuration cr randomly chosen out of the grand canonical distribution with + boundary 
conditions and h = will contain, with probability approaching 1 as O — > 00, a number 
K^(cr) of contours congruent to 7 such that 



\K {l) (cr) - p( 7 ) |0|| < C^nje"^ 171 



(18.1) 



and this relation holds simultaneously for all 7 's. 

Thus there are very few contours (and the larger they are the smaller is, in absolute 
and relative value, their number): a typical spin configuration in the grand canonical 
ensemble with (+) -boundary conditions is such that the large majority of the spins is 
"positive" and, in the "sea" of positive spins, there are a few negative spins distributed in 
small and rare regions (in a number, however, still of order of |0|). 

Another consequence of the analysis in Sect. 17 concerns the the approximate equation 
of state near the phase transition region at low temperatures and finite O. 

mn(/3, h) 

0(|A|- 1/2 ) 



J 


^mn(P, h) 


-o(l«r 1/2 ) 


/ o(N~ 1/2 ) 


•- 


► 

t -m*(j3) 







m*(f3) 

-0(|A|" 1/2 ) 



h 



(3) 



h 



-m 



If Q is finite the graph of h — > m^(/3, h) will have a rather different behavior depending 
on the possible boundary conditions; e.g. if the boundary condition is (+) or (— ) one gets 
respectively, the results depicted in Fig. 3, where m*(f3) denotes the spontaneous magneti- 

def 

zation, i.e. m*(/3) = lim /l ^ + limj^oo mn(/3, h). 

With periodic or empty boundary conditions the diagram changes as in the first of 
Fig. 4: the thermodynamic limit m((3,h) = limn^oo mn(/?, h) exists for all h 7^ and 
the resulting graph is in the second of Fig. 4, which shows that at h = the limit is 
discontinuous. It can be proved, if (3 is large enough, that 00 > lim^ + 9/ l m(/3, h) = 
x(P) > (i.e. the angle between the vertical part of the graph and the rest is sharp). 

mn(/3, h) 

1--- 
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m 



m(f3, h) 



(4) 



h 



Furthermore it can be proved that m((3, h) is analytic in h for h 7^ 0. If (3 is small enough 
analyticity holds at all h. For (3 large the function f((3,h) has an essential singularity 
at h = 0: a result that can be interpreted as excluding a naive theory of metastability 
as a description of states governed by an equation of state obtained from an analytic 
continuation to negative values of h of /(/3, h). 

The above considerations and results further clarify what a phase transition for a finite 
system means. 

Bibliography: [Ga99], [FP04]. 

19. Beyond low temperature (ferromagnetic Ising model) 

A limitation of the results discussed above is the condition of low temperature 
large enough"). A natural problem is to go beyond the low-temperature region and to 
describe fully the phenomena in the region where boundary condition instability takes place 
and first develops. A number of interesting partial results are known, which considerably 
improve the picture emerging from the previous analysis. A striking list, but far from 
exhaustive, of such results follows and focuses on properties of ferromagnetic Ising spin 
systems. The reason for restricting to such cases is that they are simple enough to allow a 
rather fine analysis which sheds considerable light on the structure of statistical mechanics 
suggesting precise formulation of the problems that it would be desirable to understand in 
more general systems. 

(1) Let z d = e^ h and consider the the product of z v (V is the number of sites of O) times 
the partition function with periodic or perfect-wall boundary conditions with finite range 
feromagnetic interaction, not neceesarily nearest neighbor; a polynomial in z (of degree 
2V) is thus obtained. Its zeros lie on the unit circle \z\ = 1: this is Lee- Yang's theorem. It 
implies that the only singularities of /(/?, h) in the region < (3 < 00, —00 < h < +00 can 
be found at h = 0. 

A singularity can appear only if the point z = 1 is an accumulation point of the 
limiting distribution (as O — > 00) of the zeros on the unit circle: if the zeros are zi, ... , Z2V 



then y log z v Z((3, h, O, periodic) = 2(3 J + (3h + y J2i=i ^°E,( Z ~ z i) an d if V 1 • (number 
of zeros of the form z 3 = e id i with 9 < 63 < 9 + d9) it is 



The existence of the measure dpp(9) follows from the existence of the thermodynamic limit: 
but dpp{9) is not necessarily <i6*-continuous, i.e. proportional to d9. 

(2) It can be shown that, with a not necessarily nearest neighbor interaction, the zeros of 
the partition function do not move too much under small perturbations of the potential 
even if one perturbs the energy (at perfect-wall or periodic boundary conditions) into 




(19.1) 



H' Q (a) = H n (a) + (SH n )(<r), (SH n )(a) = £ J'(X) a x 



(19.2) 
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where J'(X) is very general and denned on subsets X = (xi, . . . ,Xk) C O such that the 
quantity || J'\\ = sup yeZ d ^2 ye x l^'(-^OI i s small enough. More precisely fixed a ferromag- 
netic pair potential J suppose that one knows that, when J' = 0, the partition function 
zeros in the variable z = e^ h lie in a certain closed set N (of the unit circle) in the z- 
plane. Then if J' ^ they lie in a closed set N 1 , O-independent and contained in a 
neighborhood of N of width shrinking to when || J'\\ — > 0. This leads to establish various 
relations between analyticity properties and boundary condition instability as described 
in (3) below. 

(3) In the ferromagnetic Ising model, with a not necessarily nearest neighbor interaction, 
one says that there is a gap around if dpp(6) = near 6 = 0. It can be shown that if (3 
is small enough there is a gap for all h of width uniform in h. 

(4) Another question is whether the boundary condition instability is always revealed by 
the one-spin correlation function (i.e. by the magnetization) or whether it might be shown 
only by some correlation functions of higher order. It can be proved that no boundary 
condition instability takes place for h ^ 0; at h = it is possible only if 

lim m(/3,h)^ lim m((3, h) (19.3) 
h^o- h^o+ 

(5) A consequence of the Griffiths' inequalities, c.f. Sect. 16, is that if (19.3) is true for a 
given Pq then it is true for all (3 > (3q. Therefore item (4) leads to a natural definition 
of the critical temperature T c as the least upper bound of the T's such that (19.3) holds 
(k B T = (3~ 1 ). 

(6) If d = 2 the free energy of the nearest neighbor ferromagnetic Ising model has a 
singularity at (3 C and the value of (3 C is known exactly from the exact solutions of the 

model: m(/3, 0) d = ' m*(j3) = (1 — sinh 4 2j3J)^ . The location and nature of the singularities 
of /(/3, 0) as a function of (3 remains an open question for d = 3. In particular the question 
of whether there is a singularity of /(/?, 0) at (3 = f3 c is open. 

(7) For f3 < (3 C there is instability with respect to boundary conditions (see (6) above) 
and a natural question is: how many "pure" phases can exist in the ferromagnetic Ising 
model? (c.f. (9.2) and Sect. 9). Intuition suggests that there should be only two phases: 
the positively magnetized and the negatively magnetized ones. 

One has to distinguish between translation invariant pure phases and non translation 
invariant ones. It can be proved that, in the case of the 2-dimensional nearest neigh- 
bor ferromagnetic Ising models, all infinite volume states (c.f. Sect. 15) are translationally 
invariant. Furthermore they can be obtained by considering just the two boundary condi- 
tions + and — : the latter states are pure states also for models with non nearest neighbor 
ferromagnetic interaction. The solution of this problem has led to the introduction of many 
new ideas and techniques in statistical mechanics and probability theory. 

(8) In any dimension d > 2, for (3 large enough it can be proved that the nearest neigbor 
Ising model has only two translation invariant phases. If the dimension is > 3 and (3 is 
large the + and — phases exhaust the set of translationally invariant pure phases but there 
exist non translationally invariant phases. For (3 close to (3 C , however, the question is much 
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more difficult. 



Bibliography: [On44], [LY52], [Ru71], [Si91], [Ga99], [Ai80], [Hi81], [FP04] 
20. Geometry of phase coexistence. 

Intuition about the phenomena connected with the classical phase transitions is usu- 
ally based on the properties of the liquid-gas phase transition; this transition is experimen- 
tally investigated in situations in which the total number of particles is fixed (canonical 
ensemble) and in presence of an external field (gravity). 

The importance of such experimental conditions is obvious; the external field produces 
a nontranslationally invariant situation and the corresponding separation of the two phases. 
The fact that the number of particles is fixed determines, on the other hand, the fraction 
of volume occupied by each of the two phases. 

Once more consider the nearest neighbor ferromagnetic Ising model: the results avail- 
able for it can be used to obtain a clear picture of the solution to problems that one would 
like to solve but which in most other models are intractable with present day techniques. 

It will be convenient to discuss phase coexistence in the canonical ensemble distribu- 
tions on configurations of fixed total magnetization M = mV, see Sect. 15, (15.4). Let f3 
be large enough to be in the two phases region and, for a fixed a G (0, 1), let 

m = am*(j3) + (l-a)(-m'(j3))= (l-2a)m*(/3) (20.1) 

i.e. m is in the vertical part of the diagram m = m((3, h) at (3 fixed (see Fig. 4). 

Fixing m as in (20.1) does not yet determine the separation of the phases in two 
different regions; for this effect it will be necessary to introduce some external cause favoring 
the occupation of a part of the volume by a single phase. Such an asymmetry can be 
obtained in at least two ways: through a weak uniform external field (in complete analogy 
with the gravitational field in the liquid-vapor transition) or through an asymmetric field 
acting only on boundary spins. This second way should have the same qualitative effect 
as the former, because in a phase transition region a boundary perturbation produces 
volume effects: see Sect. 9,17. From a mathematical point of view it is simpler to use a 
boundary asymmetry to produce phase separations and the simplest geometry is obtained 
by considering ±- cylindrical or ++- cylindrical boundary conditions: this means ++ or 
± boundary conditions periodic in one direction (e.g. in Fig. 2 imagine the right and left 
boundary identified after removing the boundary spins on them). 

Spins adjacent to the bases of O act as symmetry-breaking external fields. The ++- 
cylindrical boundary condition should favor the formation inside O of the positively mag- 
netized phase; therefore it will be natural to consider, in the canonical distribution, this 
boundary condition only when the total magnetization is fixed to be the spontaneous 
magnetization m*((3). 

On the other hand, the ±-boundary condition favors the separation of phases (pos- 
itively magnetized phase near the top of O and negatively magnetized phase near the 
bottom) . Therefore it will be natural to consider the latter boundary condition in the case 
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of a canonical distribution with magnetization m = (1 — 2a) m*((3) with < a < 1, (20.1). 
In the latter case the positive phase can be expected to adhere to the top of Q and to 
extend, in some sense to be discussed, up to a distance O(L) from it; and then to change 
into the negatively magnetized pure phase. 

To make precise the phenomenological description consider the spin configurations cr 
through the associated sets of disjoint polygons (c.f. Sect. 17). Fix the boundary conditions 
to be ++ or ±-cylindrical boundary conditions and note that polygons associated with a 
spin configuration cr are all closed and of two types: the ones of the first type, denoted 
71, ... , 7 n , are polygons which do not encircle Q, the second type of polygons, denoted by 
the symbols A a , are the ones which wind up, at least once, around O. 

So a spin configuration cr will be described by a set of polygons the statistical weight 
of a configuration cr = (71, . . . , j n , X 1 , . . . , X h ) is (c.f. (17.1)): 

e-WEj^l+E^-l). (20.2) 

The reason why the contours A that go around the cylinder Q are denoted by A (rather 
than by 7) is that they "look like" open contours, see Sect. 17, if one forgets that the 
opposite sides of O have to be identified. In the case of the ±-boundary conditions then 
the number of polygons of A-type must be odd (hence 7^ 0), while for the ++-boundary 
condition the number of A-type polygons must be even (hence it could be 0). 

Bibliography: [Si91], [Ga99] 

21. Separation and Coexistence of Phases 



In the context of the geometric description of the spin configuration in Sect. 20 consider 
the canonincal distributions with ++- cylindrical or the ±-cylindrical boundary conditions 
and zero field: they will be denoted briefly as /z / g j ++, A*/3,±- The following theorem Minlos- 
Sinai's theorem provided the foundations of the microscopic theory of coexistence: it is 
formulated in dimension d = 2 but, modulo obvious changes, it holds for d > 2. 

For < a < 1 fixed let m = (1 — 2a) m*((3); then for (3 large enough a spin configura- 
tion cr = (71, ... , 7 n , Ai, . . . , A2/1+1) randomly chosen with the distribution £t/3,± enjoys the 
properties (l)-(4) below with a \ip ^-probability approaching 1 as O — > 00; 

(1) cr contains only one contour of X-type and 

I |A| - (1 +e((3))L\ < o(L) (21.1) 

where s(j3) > is a suitable (a-independent) function of (3 tending to zero exponentially 
fast as f3 — > 00. 

(2) 7/0^,0^" denote the regions above and below X and = V, are the volumes 
of O, O - it is 

I \n+\-aV\ < k((3) V 3/4 -(l-a)V\ < k((3)V 3/4 (21.2) 
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where k((3) p^^ exponentially fast; the exponent |, here and below, is not optimal. 

(3) IfM x = f^ enx a x , it is 

\M\ — am* {P)V\< k([3)V 3/ \ M' x -(l-a)m*((3)V\ < k([3)V 3/4 (21.3) 

(4) If K^(cr) denotes the number of contours congruent to a given 7 and lying in then, 
simultaneously for all the shapes 0/7: 

I K*(&) - pin) aV\< Ce- pJ ^V^ C>0 (21.4) 

where p{n) < e~ 2 ^ J ^ is the same quantity already mentioned in the text of the theorem of 
§(6.8). A similar result holds for the contours below A (c.f. the comments on (18.1)). 

The theorem not only provides a detailed and rather satisfactory description of the 
phase separation phenomenon, but it also furnishes a precise microscopic definition of the 
line of separation between the two phases, which should be naturally identified with the 
(random) line A. 

A similar result holds in the canonical distribution fj,p j++jm *^ where (1) is replaced 
by: no A-type polygon is present, while (2), (3) become superfluous and (4) is modified in 
the obvious way. In other words a typical configuration for the distribution the fip :++jm *(p^ 
has the same appearance as a typical configuration of the corresponding grand canonical 
ensemble with (+)-boundary condition (whose properties are described by the theorem of 
Sect.19). 

Bibliography: [Si91], [Ga99]. 

22. Phase separation line and surface tension 

Continuing to refer to the nearest neighbor Ising ferromagnet, the theorem of Sect. 21 
means that, if (3 is large enough, then the microscopic line A, separating the two phases, 
is almost straight (since e{(3) is small). The deviations of A from a straight line are more 
conveniently studied in the grand canonical distributions p± with boundary condition set 
to +1 in the upper half of dfl, vertical sites included, and to —1 in the lower half: this 
is illustrated in Fig. 2, Sect. 17. The results can be converted into very similar results for 
grand canonical distributions with i-cylindrical boundary conditions of Sect. 21, 22. 

Define A to be rigid if the probability that A passes through the center of the box O 
(i.e. 0) does not tend to as O — > 00; otherwise it is not rigid. 

The notion of rigidity distinguishes between the possibilities for the line A to be 
"straight": the "excess" length e((3)L, see (21.1), can be obtained in two ways: either the 
line A is essentially straight (in the geometric sense) with a few "bumps" distributed with 
a density of order s(j3) or, otherwise, it is only locally straight and with an important part 
of the excess length being gained through a small bending on a large length scale. In three 
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dimensions a similar phenomenon is possible. Rigidity of A, or its failure, can in principle 
be investigated by optical means; there can be interference of coherent light scattered by 
macroscopically separated surface elements of A only if A is rigid in the above sense. 

It has been rigorously proved that, the line A is not rigid in dimension 2. And, at least 
at low temperature, the fluctuation of the middle point is of the order O(VZ). In dimension 
3 however it has been shown that the surface A is rigid at low enough temperature. 

A deeper analysis is needed to study the shape of the separation surface under oth- 
erconditions, e.g. with + boundary conditions and in a canonical distribution with mag- 
netization intermediate between ±m*((3). It involves, as a prerequisite, the definition and 
many properties of the surface tension between the two phases. Here only the defini- 
tion of surface tension in the case of ±-boundary conditions in the 2-dimensional case 
will be mentioned. If Z++ (tt, m* (/3)) and Z + -(tl,m) are the canonical partition func- 
tions for the ++ and ±-cylindrical boundary conditions the tension r(f3) is defined as 

(3t{(3) = — limn^oo \ log z++{^rn*{i3)) • limit can be shown to be a-independent for 
(3 large enough: the definition motivation and justification is based on the microscopic 
geometric description in Sect. 20. The definition can be naturally extended to higher di- 
mension (and to more general non nearest neighbor models). If d = 2 the tension r can be 
exactly computed at all temperature below criticality and is f3r((3) — +2/3J + logtanh/3J. 

More remarkably the definition can be extended to define the surface tension r(/3, n) 
in the "direction n", i.e. when the boundary conditions are such that the line of separation 
is in the average orthogonal to the unit vector n. In this way if d = 2 and a G (0, 1) is 
fixed it can be proved that at low enough temperature the canonical distribution with 
+ boundary conditions and intermediate magnetization m = (1 — 2a)m*((3) has typical 
configurations containing a region of spins — of area ~ aV; furthermore if the container is 
rescaled to size L = 1 the region will have a limiting shape filling an area a bounded by a 
smooth curve whose form is determined by the classical macroscopic Wulff 's theory of the 
shape of crystals in terms of the surface tension r(n). 

An interesting question remains open in the three-dimensional case: it is conceivable 
that the surface, although rigid at low temperature, might become "loose" at a temper- 
ature T c smaller than the critical temperature T c (the latter being defined as the high- 
est temperature below which there are at least two pure phases). The temperature T c , 
whose existence is rather well established in numerical experiments, would be called the 
"roughening transition" temperature. The rigidity of A is connected with the existence of 
translationally noninvariant equilibrium states. The latter exist in dimension d = 3, but 
not in dimension d = 2 where the discussed nonrigidity of A, established all the way to T c , 
provides the intuitive reason for the absence of non translationally invariant states. It has 
been shown that in d = 3 the roughening temperature T c (/3) necessarily cannot be smaller 
than the critical temperature of the 2-dimensional Ising model with the same coupling. 

Note that existence of translationally noninvariant equilibrium states is not necessary 
for the description of coexistence phenomena. The theory of the nearest neighbor two- 
dimensional Ising model is a clear proof of this statement. 

Bibliography: [On44], [Be75], [Si91], [Mi95], [PV99], [Ga99], 
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23. Critical points 



Correlation functions for a system with short range interactions and in an equilibrium 
state which is a pure phase have cluster properties, see (9.2): their physical meaning is that 
in a pure phase there is independence between fluctuations occurring in widely separated 
regions. The simplest cluster property concerns the "pair correlation function", i.e. the 
probability density p(qi,q2) of finding particles in points qi,q2 independently of where 
the other particles may happen to be, see (9.3). In the case of spin systems the pair 
correlation p(qi,q2) = (c"qiCq 2 ) wu l be considered. The pair correlation of a translation 
invariant equilibrium state has a cluster property, (9. 2), (15. 6), if 

Ip(qi,q 2 )-P 2 i , qi _ q2Hoo = (23-1) 

where p is the probability density for finding a particle at q (i.e. the physical density of 
the state) or p = (cr q ) is the average of the value of the spin at q (i.e. the magnetization 
of the state). 

A general definition of critical point is a point c in the space of the parameters char- 
acterizing equilibrium states, e.g. /?, A in grand canonical distributions or f3,v in canonical 
distributions or /?, h in the case of lattice spin systems in a grand canonical distribution. 
In systems with short range interaction (i.e. with (f(r) vanishing for |r| large enough) 
the point c is a critical point if the pair correlation tends to 0, see (23.1), slower than 
exponential, for instance as a power of the distance |r| = |qi — q 2 |. 

A typical example is the 2-dimensional Ising model on a square lattice and with nearest 
neighbor ferromagnetic interaction of size J. It has a single critical point at (3 = /3 C , h = 
with sinh2/? c J = 1. The cluster property is that (c x a y ) — (o- x ){a y ) , _ , >0 as 

-n(/3)\x-y\ -K((3)\x-y\ i 

A+((3) -, A-((3)— A c - -r (3 = (3 C (23.2) 

V\ X ~V\ \x-y? \x-y\t 

for (3 < (3 C , (3 > (3 C or (3 = (3 C respectively, where A±((3), A c , k((3) > 0. The properties 
(23.2) stem from the exact solution of the model. 

At the critical point several interesting phenomena occurr: the lack of exponential 
decay indicates lack of a length scale over which really distinct phenomena can take place 
and properties of the system observed at different length scales are likely to be simply 
related by suitable scaling transformations. Many efforts have been dedicated at finding 
ways of understanding quantitatively the scaling properties which relate different observ- 
ables. The result has been the development of the renormalization group approach to 
critical phenomena, c.f. Sect. 25. The picture that emerges is that the closer the critical 
point is the larger becomes the maximal scale of length below which scaling properties are 
observed. For instance in a lattice spin system in field the magnetization M|A| _a in a 
box AcO should have essentially the same distribution for all A's with side < Iq((3) and 
lo((3) oo as (3 — > (3 C , provided a is suitably chosen. The number a is called a critical 
exponent. 



2//ebbraio/2008; 14:37 



38 



There are several other "critical exponents" that can be defined near a critical point. 
They can be associated with singularities of the thermodynamic function or with the 
behavior of the correlation functions involving joint densities at two or more than two 
points. As an example consider a lattice spin system: then the "2n-spins correlation" 
(cr cr ?1 . • •cr^ 2n _ 1 ) c could behave proportionally to X2n(0,£i> • • • j&n-i), n = 1,2,3, .. .„ for 
a suitable family of homogeneous functions Xn, of some degree W2n, of the coordinates 
(£1, • • • i £2n-i) at least when the reciprocal distances are large but < Iq((3) and lo(/3) = 
const ((3 — I3 C )~ U p^p^ °°- This means that if & are regarded as points in K there are 
functions %2n such that: 

X2n (0, |, . . . , ^i) = A— X 2n(0, < A e R (23.3) 

and (cr cr^ . . . a^ n _ 1 ) c oc X2n(0, £i, . . . , 6>n-i) if 1 < l^i < lo(fi). The numbers w 2 n 

define a sequence of critical exponents. 

Other critical exponents can be associated with approaching the critical point along 
other directions: for instance along h — > at f3 = f3 c . In this case the length up to which 
there are scaling phenomena is lo(h) = £ D h~ v . And the magnetization m(h) tends to as 
h — > at fixed j3 = f3 c as m(h) = mohs for S > 0. 

None of the critical exponents is in any generality known rigorously, including its 
existence. An exception is the case of the 2-dimensional nearest neighbor Ising ferromagnet 
where some exponents are known exactly (e.g. u>2 = |, u>2n = nu2, or v = 1 while 5, V are 
not rigorously known). Nevertheless for Ising ferromagnets (even not nearest neighbor but, 
as always here, finite range) in all dimensions all mentioned exponents are conjectured to 
be the same as those of the nearest neighbor Ising ferromagnet. A further exception is the 
derivation of rigorous relations between critical exponents and, in some cases, even their 
values under the assumption that they exist. 

Remark: Naively it could be expected that in a pure state in field with (a x ) = the 
quantity s = |A|~2 J2 xeA o~ x , if A is a cubic box of side £, should have a probability 
distribution which is Gaussian with dispersion liniA^oo (s 2 )- This is "usually true": but 
not always. The (23.2) shows that in the d = 2 ferromagnetic nearest neighbor Ising 
model (s 2 ) diverges proportionally to £ 2 ~^ so that the variable s cannot have a Gaussian 
distribution. The variable S = |A|~I J2 x ga (7 x will have a finite dispersion: however there 
is no reason that it should be Gaussian. This makes clear the great interest of a fluctuation 
theory and its relevance for the critical point studies, see Sect. 24, 25. 

Bibliography: [On44], [DG72], [MW73], [Ai82]. 
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24. Fluctuations 



As it appears from Sect. 23 fluctuations of observabes around their averages have 
interesting properties particularly at critical points. Of particular interest are observables 
which are averages over large volumes A of local functions F(x) on phase space: this 
is so because macroscopic observables often have this form. For instance given a region 
A inside the system container fi, A C fi, consider a configuration x = (P, Q) and the 
number of particles N\ = X] q eA 1 m A, or the potential energy $a = J2(q q')eA ~~ 
or the kinetic energy K\ = X] q eA 2m P 2 - I* 1 the case of lattice spin systems consider a 
configuration a and, for instance, the magnetization Ma = XlieA ai m ^- Label the above 
four examples by a = 1, . . . , 4. 

Let \i a be the probability distribution describing the equilibrium state in which the 

quantities A a are considered; let xa = ifjtf) an d p d — Xa \ a"| XA • Then typical fluctuations 
properties to investigate are (a = 1 ... 4) , 

(1) for all S > z£ z's liniA^oo A*a(bl > 5) = flaw o/ /arge numbers) 

(2) t/iere is D a > swc/i i/ia£ n{p^/\K\ e [a, 6]) A ^ oo > J a -/^=g e~ 2P " (central limit law) 

(3) i/iere z's an interval I a = (p* , + ) and a concave function F a (p), p <E I, such that if 
[a, 6] C / i/ien m log/i(p G [a, 6]) A ^ oo > max pe [ a &] F a (p) (large deviations law) 

The law of large numbers provides the certainty of the macroscopic values; the central 
limit law controls the small fluctuations (of order a/|A[) of Aa around its average; the 
large deviations law concerns the fluctuations of order |A|. 

The relations (1),(2),(3) are not always true: they can be proved under further general 
assumptions if the potential cp satisfies (7.2) in the case of particle systems or if \<p(q) \ < 
oo in the case of lattice spin systems. The function F a (p) is defined in terms of the 
thermodynamic limits of suitable thermodynamic functions associated with the equilibrium 
state /j, a . The further assumption is, essentially in all cases, that a suitable thermodynamic 
function in terms of which the F a (p) will be expressed is smooth and with a nonvanishing 
second derivative. 

For the purpose of a simple concrete example consider a lattice spin system of Ising 
type with energy — ye u f( x — y)o~ x o~ y — Ylx ha x and the fluctuations of the magneti- 
zation Ma = XLeA o'x, A C O, in the grand canonical equilibrium states f^h,p- 

Let the free energy be (3f(f3, h), see (15.5), let m = m(h) d = (-y^y) and let h(m) the 
inverse function of m(h). If p = the function F{p) is 

F(p) = (3{f((3, hip)) - /(/?, h) - d h f(P, h)(h(p) - h)) (24.1) 
then a quite general result is 

The relations ( 1),( 2), (3) hold if the potential satisfies J2 X \<fi(x)\ < oo and if F(p), (24-1), is 
smooth and F"(p) ^ in open intervals around those in which p is considered, i.e. around 
p = for the law of large numbers and for the central limit law or in an open interval 
containing a, b for the case of the large deviations law. 
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In the envisaged cases the theory of equivalence of the ensembles implies that the 
function F can also be computed via thermodynamic functions naturally associated with 
other equilibrium ensembles. For instance instead of the grand canonical f(f3, h) one could 
consider the canonical (3g((3,m), see (15.5), then 

F(p) = -(3 {g(p, p) - g(p, m) - d m g(p, m) (p - m)) (24.2) 

It has to be remarked that there should a strong relation between the central limit law 
and the law of large deviations. Setting aside stating the conditions for a precise mathe- 
matical theorem, the statement can be rapidly illustrated in the case of a ferromagnetic 
lattice spin system and with A = O, by showing that the large deviations law in small 
intervals, around the average m(ho), at a value ho of the external field is implied by the 
validity of the central limit law for all values of h near h and viceversa (here (3 is fixed). 
Taking ho = (for simplicity) the heuristic reasons are the following. Let Hh,ci be the 
grand canonical distribution in external field h. Then 

(1) The probability Hh,n{p £ dp) is proportional, by definition, to Ho,ci(p £ dp)e~ /3hm ^. 
Hence if the central limit law holds for all h near ho = there will exist two functions 
m(h) and D(h) > 0, defined for h near ho = 0, with m(0) = and 

fi (p e dp)e- f3hp = const e- ]m(P '^ )2+o(n) dp (24.3) 

(2) there is a function ((m) such that d m ((m(h)) = j3h and <9^C(m(/i)) = -D(/i) _1 . [This 
is obtained by remarking that, given D(h), the differential equation d m (3h = D{h)~ x with 
the initial value h(0) = determines the function him); therefore ((m) is determined by 
a second integration, from d m ((m) = j3h{m). 

It then follows, heuristically, that the probability of p in zero field has the form 
const e^ p )l°l dp so that the probability that p G [a, b] will be const e' n l max pe[a,b] C(p)_] 

Viceversa the large deviations law for p at h = implies the validity of the central 
limit law for the fluctuations of p in all small enough fields h: this simply arises from the 
function F(p) having a negative second derivative. 

This means that there is a '"duality" between central limit law and large deviation law 
or that the large deviations law is a "global version" of the central limit law. In the sense 
that 

(a) if the central limit law holds for h in an interval around ho then the fluctuations of the 
magnetization at field ho satisfy a large deviation law in a small enough interval J around 
m(h ), 

(b) if a large deviation law is satisfied in an interval around ho then the central limit law 
holds for the fluctuations of magnetization around its average in all fields h with h — ho 
small enough. 

Going beyond the heuristic level in establishing the duality amounts to giving a precise 
meaning to "small enough" and to discuss which properties of m{h) and D(h) or of F(p) 
are needed to derive properties (a),(b). 

For illustration purposes consider the Ising model with ferromagnetic interaction ip: 
then the central limit law holds for all h if j3 is small enough and, under the same condition 
on /?, the large deviations law holds for all h and all intervals [a, b] C (—1, 1). If (3 is not 
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small then the condition h ^ has to be added. Hence the conditions are fairly weak and 
the apparent exceptions concern the value h = and f3 not small where the statements 
may become invalid because of possible phase transitions. 

In presence of phase transitions the law of large numbers, the central limit law and law 
of large deviations should be reformulated. Basically one has to add the requirement that 
fluctuations are considered in pure phases and change in a natural way the formulation of 
the laws. For instance the large fluctuations of magnetization in a pure phase of the Ising 
model in zero field and large (3 (i. e. in a state obtained as limit of finite volume states with + 
or — boundary conditions) in intervals [a, b] which do not contain the average magnetization 
m* are not necessarily exponentially small with the size o/|A|: if [a, b] C [— m*,m*] they 
are exponentially small but only with the size of the surface of A (i.e. with | A | ) while 
they are exponentially small with the volume if [a, b] n [—m*, m*] = 0. 

The discussion of Sect. 23 shows that also at the critical point the nature of the large 
fluctuations is expected to change: no central limit law is expected to hold in general 
because of the example of (23.2) with the divergence of the average of the normal second 
moment of the magnetization in a box as the side tends to oo. 

Bibliography: [0187] 

25. Renormalization group 

The theory of fluctuations just discussed concerns only fluctuations of a single quantity. 
The problem of joint fluctuations of several quantities is also interesting and led to really 
new developments in the 1970's. It is necessary to restrict attention to rather special 
cases in order to illustrate some ideas and the philosophy behind the approach. Consider 
therefore the equilibrium distribution fx associated with one of the classical equilibrium 
ensembles. For instance the equilibrium distribution of an Ising energy function /3Hq, 
having included the temperature factor in the energy: the inclusion is done because the 
discussion will deal with the properties of hq as a function of (3. It will also be assumed 
that the average of each spin is zero ("no magnetic field", see (15.3) with h = 0). Wanting 
to keep in mind a concrete case imagine that (3Hq is the energy function of the nearest 
neighbor Ising ferromagnet in zero field. 

Imagine that the volume O of the container has periodic boundary conditions and is 
very large, ideally infinite. Define the family of blocks parameterized by £ G Z d and 
k integer, consisting of the lattice sites x = {k£i < Xi < (k + l)£i}. This is a lattice of 
cubic blocks with side size k that will be called the "/c-rescaled lattice". The quantities 
= k~ ad X] x efc£ °x are cane d the block spins. 

Given a define the map R* a k fiQ = transforming the initial distribution on the 
original spins into the distribution of the block spins. Note that if the initial spins have 

id id 

only two values cr x = ±1 the block spins take values between — -g^ and at steps of 
size -^Id . Furthermore the map R* a k makes sense independently of how many values the 
initial spins can assume, and even if they assume a continuum of values S x G M. 

Taking a = 1 means, for k large, looking at the probability distribution of the joint 
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large fluctuations in the bloks k£. Taking a = | corresponds to studying a joint central 
limit property for the block variables. 

Considering a one parameter family of initial distributions fiQ parameterized by a 
parameter (3 (that will be identified with the inverse temperature), typically there will be 
a unique value a((3) of a such that the joint fluctuations of the block variables admit a 
limiting distribution, 

rih) 

prob fc (m£ G [at, 6 € ], a G A) / y A ((^) €eA ) jj d5 € (25.1) 

for some distribution g\(z) on M A . 

If a > a{j3) the limit will then be n^eA^(^) dS^, or if a < a((3) the limit will not 
exist (because the block variables will be too large, with a dispersion diverging as k — > oo). 

It is convenient to choose as sequence of k — > oo the sequence k = 2 n with n = 0, 1, . . . 
because in this way it is R* a k = i?*^ and the limit k — > oo along the sequence k = 2 n can 
be regarded as a sequence of iterations of a map i?* x acting on the probability distributions 

of generic spins S x on the lattice Z d (the sequence 3 n would be equally suited). 

It is even more convenient to consider probability distributions that are expressed in 
terms of energy functions H which generate, in the thermodynamic limit, a distribution fx: 
then i?* 1 defines an action R a on the energy functions so that R a H — H' if H generates 
fi, H' generates fx' and i?* 1 fi = fi' . Of course the energy function will be more general 
than (15.3) and a form like 8U in (19.2) has to be admitted, at least. 

In other words R a gives the result of the action of R& i expressed as a map acting 
on the energy functions. Its iterates define also a semigroup which is called the block spin 
renormalization group. 

While the map R* a l is certainly well defined as a map of probability distributions into 
probability distributions, it is by no means clear that R a is well defined as a map on the 
energy functions. Because if ii is given by an energy function it is not clear that R* a ^fi is 
such. 

A remarkable theorem can be (easily) proved when R* a 1 and its iterates act on initial 
/Uo's which are equilibrium states of a spin system with short range interactions and at 
high temperature ((3 small). In this case if a = h the sequence of distributions R*^ 1 1 A*o(/?) 

2 ' 1 

admits a limit which is given by a product of independent Gaussians 



prob fc (m^ G [o € ,6 € ], ^A)^ / * J] e"^W S l J] ? £l= (25.2) 



Note that this theorem is stated without even mentioning the renormalization maps i??: 

2 

it can nevertheless be interpreted as saying that 
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but the interpretation is not rigorous because (25.2) does not even say that R\Hq{(3) 

2 

makes sense for n > 1. The (25.2) says that at high temperature block spins have normal 
independent fluctuations: it is therefore an extension of the central limit law. 

There are a few cases in which the map R a can be rigorously shown to be well defined 
at least when acting on special equilibrium states like the high temperature lattice spin 
systems: but these are exceptional cases of relatively little interest. 

Nevertheless there is a vast literature dealing with approximate representations of the 
map R a . The reason is that assuming not only its existence but also assuming that it 
has the properties that one would normally expect to hold for a map acting on a finite 
dimensional space it follows that a number of consequences can be drawn, quite non trivial 
as they led to the first theory of the critical point that goes beyond the van der Waals 
theory of Sect.ll. 

The argument proceeds essentially as follows. At the critical point the fluctuations are 
expected to be anomalous (c.f. the last remark in Sect. 23) in the sense that (( ^f= ) ) 



will tend to oo, because a = \ does not correspond to the right fluctuation scale of 
S?eA s ig nann g that RV^^iflc) will not have a limit but, possibly, there is a c > \ such 

that R^ n c l no(/3 c ) converges to a limit in the sense (25.1) (a c = | in the case of the critical 
nearest neighbor Ising ferromagnet, see ending remark in Sect. 23). 

Therefore if the map R* a l is considered as acting on Ho{(3) it will happen that for all 
(3 < P c , R*£ 1/ uo(/9 c ) will converge to a trivial limit n^eA^^) dS^ because the value a c 
is greater than | while normal fluctuations are expected. 

If the map R ac can be considered as a map on the energy functions this says that 
n^eA^(^)^^ i s a "(trivial) fixed point of the renormalization group" which "attracts" 
the energy functions (3Hq corresponding to the high temperature phases. 

The existence of the critical (3 C can be associated with the existence of a non trivial 
fixed point H* for R ac which is hyperbolic with just one Lyapunov exponent A > 1, hence it 
has a stable manifold of codimension 1. Call fi* the probability distribution corresponding 
to H*. 

The migration towards the trivial fixed point for (3 < f3 c can be simply explained by the 
fact that for such values of (3 the initial energy function (3H is outside the stable manifold 
of the non trivial fixed point and under application of the renormalization transformation 
R™ c (3H migrates toward the trivial fxed point, which is attractive in all directions. 

Increasing (3 it may happen that, for (3 = (3 C , (3H crosses the stable manifold of 
the non trivial fixed point H * for i? Qc . Then i?^ c PcHq will no longer tend to the trivial 
fixed point but it will tend to H*: this means that the block spin variables will exhibit a 
completely different fluctuation behavior. If (3 is close to (3 C the iterations of R CCc will bring 
R™ c f3Ho close to H* , only to be eventually repelled along the unstable direction reaching a 
distance from it increasing as \ n \f3 — (3 C \. 

This means that up to a scale length 0(2 n ^' ) ) lattice units with X n< -^\/3 — (3 C \ = 1, 
i.e. up to a scale 0(\(3 — (3 c \~ log2 A ) the fluctuations will be close to those of the fixed point 
distribution fx*, but beyond that scale they will come close to those of the trivial fixed 
point: to see them the block spins would have to be normalized with index a = \ and 
they would appear as uncrorrelated Gaussian fluctuations, c.f. (25. 2), (25. 3). 
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The next question is "how to find the non trivial fixed points?" this means how to 
find the energy functions H* and the corresponding a c which are fixed points of R ac - If 
the above picture is correct the distributions \i* corresponding to the H* would describe 
the critical fluctuations and if there was only one choice, or a limited number of choices, 
of a c and H* this would open the way to a universality theory of the critical point hinted 
already by the "primitive" results of van der Waals' theory. 

The initial hope was, perhaps, that there would be a very small number of critical 
values a c and of H* possible: but it rapidly faded away leaving however the possibility 
that the critical fluctuations could be classified into universality classes. Each class would 
contain many energy functions which upon iterated actions of R ac would evolve under the 
control of the trivial fixed point (always existing) for f3 small while for f3 = f3 c they would 
be controlled, instead, by a non trivial fixed point H* for R ac with the same a c and the 
same H* . For (3 < (3 C a "resolution" of the approach to the trivial fixed point would be 
seen by considering the map Ri rather than R ac whose iterates would, however, lead to a 
Gaussian distribution like (25.2) (and to a limit energy function like (25.3)). 

The picture is highly hypothetical: but it is the first suggestion of a mechanism 
leading to critical points with character of universality and with exponents different from 
those of the van der Waals' theory or, for ferromagnets on a lattice, from those of its lattice 
version (the Curie- Weiss ' theory) : furthermore accepting the approximations, e.g. Wilson- 
Fisher's e-expansion, that allow one to pass from the well defined R* a l to the action of 
R a on the energy functions it is possible to obtain quite unambiguously values for a c and 
expressions for H* which are associated with the action of R ac on various classes of models. 

For instance it can lead to conclude that the critical behavior of all ferromagnetic 
finite range lattice spin systems (with energy functions given by (15.3)) have critical points 
controlled by the same a c and the same non trivial fixed point: this property is far from 
being mathematically proved but it is considered a major success of the theory. One has 
to compare it with van der Waals critical point theory: for the first time an approximation 
scheme has led, even though under approximations not fully controllable, to computable 
critical exponents which are not equal to those of the van der Waals theory. 

The renormalization group approach to critical phenomena has many variants, de- 
pending on which kind of fluctuations are considered and on the models to which it is 
applied. In statistical mechanics there are a few mathematically complete applications: 
certain results in high dimension, theory of dipole gas in d = 2, hierarchical models, some 
problems in condensed matter and in statistical mechanics of lattice spins and a few oth- 
ers. Its main mathematical successes have been obtained in various related fields where 
not only the philosophy described above can be applied but it leads to renormalization 
transformations that can be defined precisely and studied in detail: e.g. constructive field 
theory, KAM theory of quasi periodic motions, various problems in dynamical systems. 

However the applications always concern special cases and in each of them the general 
picture of the trivial-nontrivial fixed point dichotomy appears realized but without being 
accompanied except in rare cases (like the hierarchical models or the universality theory 
of maps of the interval) by the full description of stable manifold, unstable direction and 
action of the renormalization transformation on objects other than the one of immediate 
interest (a generality which looks often an intractable problem, but which also turns out 



2//ebbraio/2008; 14:37 



45 



to be not necessary), 

In the renormalization group context mathematical physics has played an important 
role also by providing clear evidence that universality classes could not be too few: this was 
shown by the numerous exact solutions after Onsager's solution of the nearest neighbor 
Ising feromagnet: there are in fact several lattice models in d = 2 which exibit critical 
points with some critical exponents exactly computable and which depend continuously on 
the models parameters. 

Bibliography: [MW73], [Ba82], [BS75], [WF72], [GK83], [GK85], [BG95], [Ma04]. 
26. Quantum statistics 

Statistical mechanics is extended to assemblies of quantum particles rather straight- 
forwardly. In the case of N identical particles the observables are operators O on the 
Hilbert space Hn = L 2 (0)^, or Hn = (L 2 (£l) <8> C 2 )^, where « = +,—, of the symmet- 
ric (a = +, bosonic particles) or antisymmetric (a = — , fermionic particles) functions 
V>(Q), Q = (qi, • • • , qjv) of the position coordinates of the particles or of the positions- 
spin coordinates t/>(Q,<t), cr = (<ti, . . . , ctn), normalized so that f IV'(Q) \ 2 dQ = 1 or 
So- 1 IV'(Q) c)| 2 ^Q = 1, here only aj = ±1 is considered. As in classical mechanics a 
state is defined by the average values (O) that it attributes to the observables. 

Microcanonical, canonical and grand canonical ensembles can be defined quite easily. 
For instance consider a system described by the Hamiltonian (h = Planck's constant) 

h 2 N 

H » = E A ^ + E <p(& - + E^) = K + $ ( 2fU ) 

5=1 3<j' j 

where periodic boundary conditions are imagined on O and w(q) is periodic smooth po- 
tential (the side of Q is supposed a multiple of the periodic potential period if w ^ 0). 
Then a canonical equilibrium state with inverse temperature (3 and specific volume v = jj 
attributes to the observable O the average value 

def Tre-^O 
[ ' Tr 

Similar definitions can be given for the grand canonical equilibrium states. 

Remarkably the ensembles are orthodic and a "heat theorem", see Sect. 4, can be 
proved. However "equipartition" does not hold: i.e. (K) ^ ^N(3~ x although (3~ x is still 
the integrating factor of dU+pdV in the heat theorem; hence keeps being proportional 
to temperature. 

Lack of equipartition is important as it solves paradoxes that arise in classical statis- 
tical mechanics applied to systems with infinitely many degrees of freedom, like crystals 
(modeled by lattices of coupled oscillators) or fields like the electromagnetic field (black 
body radiation). However although this has been the first surprise of quantum statistics 
(and in fact responsible for the very discovery of quanta) it is by no means the last. 
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At low temperatures new unexpected (i.e. with no analogue in classical statistical 
mechanics) phenomena occurr: Bose-Einstein condensation (superfluidity), Fermi surface 
instability (superconductivity) and appearance of off-diagonal long range order, with the 
acronym ODLRO, will be selected to illustrate the deeply different kind of problems of 
quantum statistical mechanics. Largely not yet understood, such phenomena pose very 
interesting problems not only from the physical point of view but also from the mathemat- 
ical point of view and may pose challenges even at a definitory level. However it should 
be kept in mind that in the interesting cases (i.e. 3-dimensional systems and even most 2 
and 1 dimensional systems) there is no proof that the objects defined here really exist for 
the systems like (26.1), see however the final comment for an important exception. 

(1) Bose Einstein condensation: In a canonical state with parameters /?, v a definition of 
the occurrence of Bose condensation is in terms of the eigenvalues i>j(Q, N) of the kernel 
p(q, q') on L 2 (H), called the 1-particle reduced density matrix, defined by 

N r Tre -p HN / V>n(q, Qi> • • • , Qiv-OV'ntq', qi, • • • , qjv-i)dqi • • • qjv-i (26.3) 

n=l 

where E n (Q,N) are the eigenvalues of and VVi(qi) . . . , qjv) are the corresponding 
eigenfunctions. If Vj are ordered by increasing value the state with parameters /?, v is said 
to contain a Bose-Einstein condensate if N) > b N > for all large Cl at v = jf,P 
fixed. This receives the interpretation that there are more than bN particles with equal 
momentum. The free Bose gas has a Bose condensation phenomenon at fixed density and 
small temperature. 

(2) Fermi surface: the wave functions V'n(qi) °i> . . . , qj\r, o~n) = V ; n(Q 5 cr ) are now an- 
tisymmetric in the permutations of the pairs (q^a^). Let ip(Q,tr; N,n) denote the n- 
th eigenfunction of the N particles energy in (26.1) with eigenvalue E(N,n) (la- 
beled by n = 0,1,... and nondecreasingly ordered). Setting Q" = (q", . . . , q^_ p ), 
a" = (cr'/, . . . , o-%_ p ), introduce the kernels p^ JV (Q, cr; Q', cr') by 



which are called p-particles reduced density matrices (extending the corresponding one 

def 

particle reduced density matrix (26.3)). Denote p(qi — q 2 ) = Pi(qi) q2 5 cr)- It is also 
useful to consider spinless fermionic systems: the corresponding definitions are obtained 
simply by suppressing the spin labels and will not be repeated. 

Let ri (k) be the Fourier transform of pi(q — q'): the Fermi surface can be defined as 
the locus of the k's in the neighborhood of which <9kri(k) is unbounded as O — > oo, (3 — > oo. 
The limit as f3 — > oo is important because the notion of a Fermi surface is, possibly, precise 
only at zero temperature, i.e. (3 = oo. 
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• V(Q, <t; Q", a"; N, n)^(Q', <r'; Q ", cr" ■ N, n) 



So far existence of Fermi surface (i.e. the smoothness of ri(k) except on a smooth 
surface in k space) has been proved in free Fermi systems (ip = 0) and 

(a) certain exactly soluble 1-dimensional spinless systems 

(b) in rather general one dimensional spinless systems or systems with spin and repulsive 
pair interaction, possibly in an external periodic potential. 

The spinning case in a periodic potential and dimension d > 2 is the most interesting 
case to study for its relevance in the theory of conduction in crystals. Essentially no 
mathematical results are available as the above mentioned ones do not concern any case 
in dimension > 1: this is a rather deceiving aspect of the theory and a challenge. 

In dimension 2 or higher, for fermionic systems with Hamiltonian (26.1), not only 
there are no results available, even without spin, but it not even clear that a Fermi surface 
can exist in presence of interesting interactions. 

(3) Cooper pairs: superconductivity theory has been phenomenologically related to the 
existence of Cooper pairs. Consider the Hamiltonian (26.1) and define, c.f. (26.4), p(x — 

def 

y, cr; x' - y , a'; x - x') = p 2 (x, a, y, -a; x , a', y , -a'). 

The system is said to contain Cooper pairs with spins a, —a (a = + or a = — ) if there 
exist functions g a (q, a) ^ with J g Q (q, o-)g a (q, a)dq = if a ^ a', such that 



lim p(x - y, a, x' - y', a', x - x') — - j— > ]T £ a (x - y, <r)g<*& - y', a') (26.5) 

Oi 

In this case g a (x — y, a) with largest norm can be called the wave function of the paired 
state of lowest energy: this is the analogue of the plane wave for a free particle (and, like 
it, it is manifestly not normalizable, i.e. it is not square integrable as a function of x, y). 
If the system contains Cooper pairs and the non leading terms in the limit (26.5) vanish 
quickly enough the 2-particles reduced density matrix, (26.5), regarded as a kernel operator 
has an eigenvalue of order V as V — > oo: i.e. the state of lowest energy is macroscopically 
occupied, quite like the free Bose condensation on the ground state. 

Cooper pairs instability might destroy the Fermi surface in the sense that ri(k) be- 
comes analytic in k; but it is also possible that even in presence of the instability a surface 
remains which is the locus of the singularities of the function r\ (k) . In the first case there 
should remain a trace of it as a very steep gradient of ri(k) of the order of an exponential 
in the inverse of the coupling strength; this is what happens in the BCS model for super- 
conductivity: the model is however a mean field model and this particular regularity aspect 
might be one of its peculiarities. In any event a smooth singularity surface is very likely 
to exist for some interesting density matrix (e.g. in the BCS model with "gap parameter 
7" the wave function o(x — y, a) = f n wn e tk '( x ~ y ) , 7 = dk of the lowest energy 

' v\ Ji J (2^)" Je(k)>0 v /e(k) 2 +7 2 

level of the Cooper pairs is singular on a surface coinciding with the Fermi surface of the 
free system). 

(4) ODLRO: Consider the /c-fermions reduced density matrix p fc (Q, cr; Q', cr') as kernel 
operators Ok on L 2 (0 x C 2 )^. Suppose k even, then if Ok has a (generalized) eigenvalue 
of order iV§ asiV^oo, y = p, the system is said to exhibit off diagonal long range order 
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of order k. For k odd ODLRO is defined to exist if Ok has an eigenvalue of order N^ k ^ 
and k > 3 (if k = 1 the largest eigenvalue of 0\ is necessarily < 1). 

For bosons consider the reduced density matrix p/c(Q; Q') regarding it as a kernel op- 
erator Ok on £2(0)^ and define ODLRO of order k to be present if 0{k) has a (generalized) 
eigenvalue of order N k as N — > 00, y = p. 

The notion of ODLRO can be regarded as a unification of the phenomena of Bose 
condensation and of existence of Cooper pairs because Bose condensation could be said to 
correspond to the kernel operator pi(qi — q2) in (26.3) having a (generalized) eigenvalue 
of order N, and to be a case of ODLRO of order 1. If the state is pure in the sense that it 
has a cluster property, see Sect. 9, 15, then ODLRO, Bose condensation and Cooper pairs 
existence imply that the system shows a spontaneously broken symmetry: conservation of 
particle number and clustering imply that the off-diagonal elements of (all) reduced density 
matrices vanish at infinite separation in states obtained as limits of states with periodic 
boundary conditions and Hamiltonian (26.1), and this is incompatible with ODLRO. 

The free Fermi gas has no ODLRO, the BCS model of superconductivity has Cooper 
pairs and ODLRO with k = 2, but no Fermi surface in the above sense (possibly too 
strict). Fermionic systems cannot have ODLRO of order 1 (because the reduced density 
matrix of order 1 is bounded by 1). 

The contribution of Mathematical Physics has been particularly effective by providing 
exactly soluble models: however the soluble models deal with 1-dimensional systems and 
it can be shown that in dimension 1, 2 no ODLRO can take place. A major advance is the 
recent proof of ODLRO and Bose condensation in the case of a lattice version of (26.1) at 
a special density value (and d > 3) . 

In no case, for the Hamiltonian (26.1) with ip 7^ 0, existence of Cooper pairs has been 
proved nor existence of a Fermi surface for d > 1. Nevertheless both Bose condensation and 
Cooper pairs formation can be proved to occurr rigorously in certain limiting situations. 
There are also a variety of phenomena (e.g. simple spectral properties of the Hamiltonians) 
which are believed to occurr once some of the above mentioned four do occurr and several 
of them can be proved to exist in concrete models. 

If d = 1, 2 the ODLRO can be proved to be impossible at T > through the use of 
Bogoliubov's inequality (used in the "no d = 2 crystal theorem", see Sect. 13). 

Bibliography: [P056],[Ya62], [Ru69], [Ho67], [Ga99], [ALSSY]. 

Appendix Al. The Physical meaning of the stability conditions 



It is therefore useful to see what would happen if the conditions of stability and 
temperedness (see (7.2)) are violated. The analysis also illustrates some of the typical 



2//ebbraio/2008; 14:37 



49 



methods of statistical mechanics. 



(a) Coalescence catastrophe due to short distance attraction: the simplest violation of the 
first condition in (7.2) occurs when the potential ip is smooth and negative at the origin. 

Let 5 > be so small that the potential at distances < 26 is < — b < 0. Consider 
the canonical distribution with parameters /?, iV in a (cubic) box O of volume V. The 
probability P co iia P se that all the N particles are located in a little sphere of radius 5 
around the center of the box (or around any pre-fixed point of the box) is estimated from 
below by remarking that $ < — b{^) ~ — |iV 2 so that 



I 3 

r rfpdq r -0(K(p)+$(q)) / 47ry2m/3-i ^ jy S 3N 0b\ N( N- 1) 

p _ JC K iN N\ ti -> I 3h'-> I N\ C ( a-] -\\ 

^collapse dpdq MK{ )+H )} ^ r dq fl»( q ) 

Phase space is extremely small: nevertheless such configurations are far more probable 
than the configurations which "look macroscopically correct", i.e. configurations with par- 
ticles more or less spaced by the average particle distance expected in a macroscopically 
homogeneous configuration, namely (y) _1/ ' 3 = . Their energy <fr(q) is of the order of 
uN for some u, so that their probability will be bounded above by 

r dpdq 0( K (p)+uN) V N ^/2mf3-^ uN 

p < J h 3N N\ c _ h 3N N\ c 1 4-1 o\ 

^regular _ . _dpdq_ /3(K (p)+*(q)) f dq 3 ^( a ) 

However ; no matter how small S is, the ratio p reg "' ar will approach OasF^oo, ^ — > t; -1 ; 

collapse * 

extremely fast because e^ bN I 2 eventually dominates over V N ~ e ^iog^_ 

Thus it is far more probable to find the system in a microscopic volume of size 5 rather 
than in a configuration in which the energy has some macroscopic value proportional to N. 
This catastrophe can be called an ultraviolet catastrophe (as it is due to the behavior at very 
short distances) and it causes the collapse of the particles into configurations concentrated 
in regions as small as we please as V — > oo. 



(b) Coalescence Catastrophe due to Long-Range Attraction. It occurs when the potential 
is too attractive near oo. To simplify suppose that the potential has a hard core, i.e. it 
is +00 for r < ro, so that the above discussed coalescence cannot occur and the system 
density bounded above by a certain quantity p cp < 00 (close packing density). 

The catastrophe occurs if </?(q) ~ — (7|q|~ 3+e , g,e > 0, for |q| large. For instance this 
is the case of matter interacting gravitationally; if k is the gravitational constant, m is the 
particles mass then g = km 2 and e = 2. 

The probability P reg uiar of "regular configurations" , where particles are at distances 
of order p -1 / 3 from their close neighbors, is compared with P co iiapse of "catastrophic 
configurations", with the particles at distances ro from their close neighbors to form a 
configuration of density p cp /(l + 6~) s almost in close packing (so that ro is equal to the 
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hard core radius times 1 + 5). In the latter case the system does not fill the available 
volume and leaves empty a region whose volume is a fraction ~ f^v-f y Q f y ^nd ^ can 

Pep 

be checked thatthe ratio P reg uiar / Pcoiiapse tends to at a rate 0(e 9 ^ JV (Pc P (i+<5) if S 

is small enough (and p < p cp ). 

A system which is too attractive at infinity will not occupy the available volume but 
will stay confined in a close packed configuration even in empty space. 

This is important in the theory of stars: stars cannot be expected to obey "regular 
thermodynamics" and in particular will not "evaporate" because their particles interact 
via the gravitational force at large distance. Stars do not occupy the whole volume given 
to them (i.e. the universe); they do not collapse to a point only because the interaction 
has a strongly repulsive core (even when they are burnt out and the radiation pressure is 
no longer able to keep them at a reasonable size, a reasonable size). 

(c) Evaporation Catastrophe: this is a another infrared catastrophe, i.e. a catastrophe due 
to the long-range structure of the interactions like (b) above; it occurs when the potential 
is too repulsive at oo: i.e. <£>(q) ~ +fi'|q| _3+£ as q — > oo so that the temperedness condition 
is again violated. 

Also in this case the system does not occupy the whole volume: it will generate a layer 
of particles sticking, in close packed configuration, to the walls of the container. Therefore 
if the density is lower than the close packing density, p < p cp: the system will leave a region 
around the center of the container O empty; and the volume of the empty region will still 
be of the order of the total volume of the box (i.e. its diameter will be a fraction of the 
box side L). The proof is completely analogous to the one of the previous case; except 
that now the configuration with lowest energy will be the one sticking to the wall and close 
packed there, rather than the one close packed at the center. 

Also this catastrophe is important as it is realized in systems of charged particles bear- 
ing the same charge: the charges adhere to the boundary in close packing configuration, 
and dispose themselves so that the electrostatic potential energy is minimal. Therefore 
charges deposited on a metal will not occupy the whole volume: they will rather form a 
surface layer minimizing the potential energy (i.e. so that the Coulomb potential in the 
interior is constant). In general charges in excess of neutrality do not behave thermody- 
namically: for instance, besides not occupying the whole volume given to them, they will 
not contribute normally to the specific heat. 

Neutral systems of charges behave thermo dynamically if they have hard cores so that 
the ultraviolet cathstrophe cannot occur or if they obey quantum mechanical laws and 
consist of fermionic particles (plus possibly bosonic particles with charges of only one 
sign). 

Bibliography: [DL67], [LL72], [LiOl] 
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Appendix A2. The subadditivity method 



A simple consequence of the assumptions is that the exponential in (5.2) can be 
bounded above by e ^ BN e~^ 2-,»=i p * so that 

1 < Z gc (P, A, V) < e Ve^y/^F^ d ^ 

1 d ( A2A ) 

< — log Z gc (P, A, V) < e f3x e /3B \[2m(P^ 

Consider for simplicity the case of a hard core interaction with finite range, c.f. 
(7.2). Consider a sequence of boxes Q n with sides 2 n Lo where Lq > is arbitrar- 
ily fixed > 2R. The partition function Z gc ((3, z) relative to the volume fi n is Z n = 
Sjv=o IvT fa. dPdQe -13 ®^ because the integral over the P variables can be explicitly 

performed and included in z N if z is defined as z = e^(2m/? -1 ) d ^ 2 . 
Then the box fi n contains 2 d boxes O n _i for m > 1 and 

1 < Z n < zl d _ 1 e' 3B2d ^-^ d ~ l22d (A2.2) 

because the corridor of width 2R around the boundaries of the 2 d cubes O n _i filling 
O n has volume 2RL n _ 1 2 d and contains at most (L n _i/ 'R) d ~ 1 2 d particles each of which 

interacts with at most 2 d other particles. Therefore f3p n d = L d logZ n < L^-^log Z n -i + 
/3S7 d 2- n (L / J R) d - 1 for some -f d > 0. Hence < (3p n < fip n - X + r d 2" n for some T d > 
and p n is bounded above and below uniformly in n: so that the limit (7.1) exists on the 
sequence L n = Lq2 u and defines a function (3p O0 {f3, A). 

A box of arbitrary size L can be filled with about (L/L^) d boxes of side with n so 
large that, prefixed 5 > 0, \poo — p n \ < S for all n > n. Likewise a box of size L n can be 
filled by about (L n /L) d boxes of size L if n is large. The latter remarks lead to conclude, 
by standard inequalities, that the limit in (7.1) exists and coincides with p^. 

The subadditivity method just demonstrated for finite range potentials with hard core 
can be extended to the potentials satisfying just stability and temperedness, c.f. Sect. 7. 

Bibliography: [Ru69], [Ga99]. 

Appendix A3. An infrared inequality 



The infrared inequalities stem from Bogoliubov's inequality. Consider as an example 
the problem of crystallization of Sect. 13. Let (■) denote average over a canonical equi- 

N 2 

librium state with Hamiltonian H = if + ^(Q) + £ ^(Q) with given temperature 

and density parameters p, p = a~ 3 . Let {X, Y} = ^j{d Pj X d q Y — d qj X d Pj Y) be the 
Poisson bracket. Integration by parts, with periodic boundary conditions, yields 
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m c,H } ) S - L^^ s . r ^ A .,Ci) (A3,) 
as a general identity. The latter identity implies, for A = {C, H}, 

({H, C}*{H, C}) = -/T 1 {H, C*}}) (A3.2) 

Hence the Schwartz inequality (A*A)({H, C}*{H, C}) > \({A*,C})\ 2 combined with the 
two relations in (A3. 1), (A3. 2) yields Bogoliubov's inequality 

Let g, h be arbitrary complex (differentiable) functions and d j = d qj 

N N 

A(Q)= f $>(q,), C(P,Q) d = / X)PiM^) (^.4) 

J=l 3=1 

Then if = £±p2 + $(q 1 ,...,q„),if$(q 1 ,...,q JV ) = | E^' ¥>( | Qj - <Jj' | ) + £ £j ^ (Qj ) , 

def 

so that {C,if"} = J2j(hjd j& — Pj(pj • d j)hj) with hj = h(qj). If h is real valued 
<{C, {C*, #}}> becomes (£ . ., ty^i ' »i'*(Q)> +<e E, ^^-) + | E,^ A) 2 > 
(integrals on just replace p 2 by 2f3~ 1 and ((Pj)i(Pj')i') = /3 _1 ^,i')- Therefore the 
average ({C, {C*, H}}) becomes 

<5X>; - V) 2 Mlq* -Qi'l) + e$>jAW(q;) + 4/r 1 £>A0 2 > (^3.5) 

ii' i j 



Choose g(q) = e l ( K + K )-q 5 fo(q) = cosq-K and bound (hj — hj/) by k 2 (q^— q.j>) , (d jhj)" 

2 
j 



by k 2 and /i 2 by 1. Hence (A3. 5) is bounded above by ND(k) with 



D( K ) d = / (/, 2 (4/3- 1 + ^E^-^') 2 l A ^-q,')l) +e^\AW(q j )\) (A3.6) 

This can be used to estimate the denominator in (A3. 3). For the l.h.s. remark that 
(A*, A) = lEf^e-^+K)! 2 and |<{A*,C}>| 2 = |<EA^)| 2 = |K + „| 2 iV 2 (p e (K) + 
p e (K + 2k)) 2 , hence (A3. 3) becomes after multiplying both sides by the auxiliary function 
7(k) (assumed even and vanishing for \k\ > ^) and summing over k, 

^E^lX>-~P> > ^ E7(K) g (^(K) + MK + 2 K )f 

(A3.7) 
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To apply (A3. 3) the averages in (A3. 6), (A3. 7) have to be bounded above: this is a technical 
point that is discussed here as it illustrates a general method of using the results on the 
thermodynamic limits and their convexity properties to obtain estimates. 

Note that (^Ek7(k)^k|Ef=ie- lk ^ | 2 ) is identically £(0)+ jr(^j<j' ft* ~ <fc')> 
with £(q) d = J2k 7(«)e iK ' q - 

Let v9 AiC (q) = ; V p(q) + Aq 2 |A( /? (q)|+r ? ^(q) and let F V (X, 77, C) = f j? logZ c (A, 77, () with 
Z c the partition function in the volume O, computed with energy U' = Pxxity ~ 

+ £ J2j W(Qj') J rV £ Yl | A.W(q^-)| . Then Fy(\, 77, £) is convex in A, 77 and it is uniformly 
bounded above and belowif \rj\, \e\, \C\ < 1 (say) and |A| < Ao: here Ao > exists if r 2 |A<^(r)| 
satisfies the assumption set at the beginning of Sect. 13 and the density is smaller than a 
close packing (this is because the potential U' will still satisfy conditions similar to (7.2) 
uniformly in |?7| < 1 and |A| small enough). 

Convexity and boundedness above and below in an interval imply bounds on the 
derivatives in the interior points: in this case on the derivatives of Fy with respect to 
A, 77, C at 0. The latter are identical to the averages in (A3. 6), (A3. 7). In this way the 
constants Bi, B 2 , B such that D(n) < n 2 E>i + eB 2 and B > Di are found. 



Bibliography: [Me68]. 
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